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Abstract 

Quantum theory shows many surprising features like the uncertainty 
principle, entanglement or nonlocality. In order to understand these fea- 
tures, several attempts have been made to formulate quantum theory 
within a more general framework of probabilistic theories. Such a frame- 
work allows to formulate postulates and study their consequences in a 
general setting. In the past, generalized probabilistic theories have mostly 
been studied to understand the nonlocality of quantum theory. 

This thesis approaches quantum theory from a different perspective. 
It is dedicated to the study of the consequences of postulates concerning 
post-measurement states. This aspect of generalized probabilistic theories 
has gained very little attention in the literature so far. As the main 
result of this thesis, we show that one very simple postulate rules out 
all probabilistic theories with a polytopic state space except for classical 
theory. This postulate states that if the outcome of a measurement can be 
predicted with certainty, then this measurement does not alter the state, 
i.e. the post-measurement state coincides with the initial state. Since this 
postulate is satisfied by quantum theory, this result gives a partial answer 
to the question which physical principles distinguish quantum theory from 
other probabilistic theories. 

To develop an understanding of this main result and of generalized 
probabilistic theories in general, we give an introduction to a framework 
which has been called the abstract state space formalism. Such an intro- 
duction has not been provided by the literature so far. This comprises 
the proof of many properties of convex sets and abstract state spaces. In 
particular, the characteristics of physical theories with a polytopic state 
space are investigated. As a side result, we show that within polytopic 
theories, classical theory can be characterized by three postulates which 
we will call repeatability, the subspace principle and the state discrimina- 
tion principle. 
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1 Introduction 

1.1 The role of generalized probabilistic theories 

Quantum theory has many physical features which, from an everyday life point 
of view, are very surprising: Heisenberg's uncertainty principle, the superposi- 
tion principle, entanglement, nonlocality and contextuality, to name but a few. 
When a physicist is asked for an explanation of these features, the only hon- 
est answer he can give is that they arise from the mathematical structure of 
quantum theory. He can mention a few comparatively weak motivations for the 
mathematical framework of quantum theory, but he cannot fully derive it from 
physical principles: he cannot say why quantum theory is the way it is. 

To get a better understanding of this situation, it is helpful to compare 
quantum theory to a physical theory which does not have this problem. Special 
relativity can be treated in two different ways. The first way is to start with the 
mathematical definition of the Minkowski spacetime. Then all physical features 
of special relativity, like the frame-independence of the speed of light and the 
principle of relativity, arise from this mathematical structure of spacetime. This 
way of dealing with special relativity is comparable to the way in which quantum 
theory is treated. The starting point of the theory is a mathematical framework, 
and physics is deduced from this mathematical structure. The advantage of this 
approach is that it is a very clear way of formulating a theory. This is, however, 
not the natural way of deriving a physical theory, and historically, this is not 
the way special relativity was discovered. The second, more natural way to deal 
with special relativity is to start with physical postulates. In this approach, 
there are no initial assumptions about the specific mathematical structure of 
spacetime. Instead, the mathematics of special relativity are derived from the 
physical postulates of the invariance of the speed of light and the principle of 
relativity. The advantage of this approach is that all explanations of physical 
features of the theory can be based on physical assumptions that underlie the 
theory. 

The unsatisfactory characteristic of quantum theory is that it lacks such 
a second approach. There is no commonly accepted complete derivation of 
the mathematical structure of quantum theory, based on undeniable, purely 
physical postulates. In the recent past, several attempts to fill this gap have 
been made, e.g. jHarOlj [MMllj [CDPllj . They provide interesting insights 
concerning the question which aspects of quantum theory could be regarded as 
being fundamental, and they might be a big step towards a physical derivation 
of the mathematics of quantum theory. However, in seeking a full derivation 
of quantum theory, the assumptions that are made to achieve this goal are not 
beyond any doubt. They exhibit several deficits: some of them are more of a 
mathematical than of a physical nature, others seem to be rather arbitrary and 
unmotivated, and others again are very strong, assuming far-reaching principles 
instead of deriving them from weaker assumptions. 

Instead of seeking a full derivation of quantum theory, interesting insights 
in partial aspects of quantum theory can be gained by considering less powerful 



assumptions about physical theories which are not intended to imply the frame- 
work of quantum theory. A way of thinking which has become more and more 
important in this issue is to consider quantum theory, or physical theories in 
general, from an information theoretical point of view. This approach suggests 
considering quantum theory in a broader context of probabilistic theories. The 
idea is that one should start with as few assumptions about the concrete nature 
of the theory as possible, assuming only that the theory is probabilistic. This 
means that in such a theory, the combination of a state and a measurement is 
not enough to predict a measurement outcome with certainty. Instead, it gives 
a probability distribution over the outcomes. This gives rise to a comparatively 
weak mathematical structure. This mathematical framework has occasionally 
been called the framework of generalized probabilistic theories. Using this prob- 
abilistic framework as a basis, one can then make further assumptions about 
the concrete properties of the theory. 

Generalized probabilistic theories have been considered in different contexts, 
provided with different additional structures depending on which particular as- 
pect of quantum or classical theory is investigated. For example, one aspect 
of quantum mechanics which attracts much attention is nonlocality. A simple 
example of a theory in the framework of generalized probabilistic theories which 
is dedicated to the study of nonlocality is the theory of the PR-box (named after 
Popescu and Rohrlich), also called nonlocal box |PR94j . In this context, the 
additional structure under investigation is the mathematical rule of assigning 
multipartite state spaces and measurements to systems of multiple constituents. 
This allows for the study of features like steering and teleportation in a more 
general setting. Other aspects under consideration are uncertainty relations and 
entropy measures or the possibility of cryptographic and information processing 
tasks. 

1.2 The ideas of this thesis 

The ideas presented in this thesis arose from the attempt to describe consecutive 
measurements (and therefore post-measurement states) in generalized proba- 
bilistic theories. This aspect has gained little attention so far. The motiva- 
tion for a further investigation of this aspect is that it seems that there is no 
straightforward definition of post-measurement states in generalized probabilis- 
tic theories. In fact, it turns out that requiring rather simple conditions about 
post-measurement states rule out a broad class of generalized probabilistic the- 
ories already. 

The focus of this thesis is on a particular class of generalized probabilistic 
theories which we call polytopic theories. There are two good reasons for re- 
stricting ones attention to polytopic theories. The first reason is that they are 
technically easier to deal with. Another reason which makes this class of theories 
attractive to deal with is the fact that most toy theories that have been "in- 
vented" for the study of generalized probabilistic theories belong to this class. A 
shortcoming of this restriction, however, is that quantum theory is not a poly- 
topic theory. Nonetheless, all properties of generalized probabilistic theories 



considered in this thesis are satisfied by quantum theory. Therefore, quantum 
theory belongs to a subclass of generalized probabilistic theories satisfying these 
properties, whereas most polytopic theories do not. 

We present the principles considered in this thesis in two groups. The first 
group consists of three principles. The first is that measurements are repeat- 
ablejll This means that if we perform a measurement twice (and we do not 
assume any intermediate dynamics which is not due to the measurement), then 
we will get the same outcome. The second principle of this group assumes 
that the set of all possible states after a measurement shows a certain subspace 
structure. The third principle is what we call the state discrimination principle. 
Roughly speaking, it states the following. Suppose that Ai and A2 are sets of 
states. Assume that we can perfectly distinguish Ai from A2 by a measure- 
ment. In addition, assume that two subsets A3, A4 C A2 are such that we can 
also perfectly distinguish A3 from A4 by a measurement. The state discrimi- 
nation principle states that in this case, we can perfectly distinguish between 
the sets Ai,A3 and A4 by a measurement. We will show that the only poly- 
topic theories which obey these three principles have a simplex structure, and 
therefore coincide with classical theory. 

The second group of principles consists of only one very simple principle. 
It states that if we know the outcome of a particular measurement in advance 
with certainty, then we can perform this measurement without altering the 
statistics of any subsequent measurement. In other words, if the state of a 
system has probability one for an outcome of a particular measurement, then 
performing this measurement does not disturb the state of the system: the 
post-measurement state of the system coincides with the initial state. As the 
main result of this thesis, we will show that surprisingly, this seemingly weak 
assumption rules out all polytopic theories except for the classical theories (i.e. 
the theories where the states form a simplex). 

1.3 Overview 

This thesis is organized in two parts. In Part HI we give an introduction to 
convex sets and to generalized probabilistic theories and we develop most of the 
techniques that we use in this thesis. In Part UH we apply these techniques to 
infer the results of this thesis. 

Part U is structured as follows. Section [5] is an introduction to the mathe- 
matics of convex sets and their interpretation in physical theories. Although we 
also talk about convex sets in general, we will particularly focus on the study of 
polytopes, since this is the kind of convex set which will be important in Part 
Ull of this thesis. We will infer many properties of polytopes which will be im- 
portant in the proofs of the results. Section [3] is an introduction to generalized 
probabilistic theories. We will introduce the mathematics necessary to treat 
physical theories in a generalized probabilistic framework. Then we will infer 



1 More precisely, we only assume repeatability for a particular type of measurements which 
we call pure measurements. For more details, see Sections \3\ and [4] 



a particular framework for generalized probabilistic theories which is called the 
abstract state space formalism. This framework will be illustrated by examples 
of theories. 

In Part mi we apply the techniques developed in Part Q] to infer the results 
of this thesis. As we mentioned in Section 11.21 our results split into two parts. 
Both parts particularly address to poly topic theories, i.e. theories where the set 
of states is a polytope. In Section [4j we show that every polytopic theory that 
satisfies repeatability, a subspace principle and a state discrimination principle 
is a classical theory. Section [5] is dedicated to the main result of this thesis. 
It infers classical theory from polytopic theory from only one simple postulate. 
This postulate states that every measurement for which the outcome can be 
predicted with certainty does not alter the state. 

We will conclude this thesis by some remarks and an outlook on possible 
generalizations in Section [5] 



Part I 

Introduction to the framework 
and the derivation of the 
techniques 

2 Convex sets 

Instead of starting with an introduction to the framework of generalized proba- 
bilistic theories, we introduce convex sets first. This gives an advantage. When 
we introduce generalized probabilistic theories in Section [3J where the sets of 
states are given by convex sets, we can refer to a variety of examples that we 
introduce in this section. To avoid dealing with convex sets without a physical 
motivation in mind, we explain in Section 12.11 how convex sums naturally arise 
in the context of random processes and how they give rise to convex subsets of 
vector spaces. In Section 12.21 we discuss how the notion of convexity could be 
generalized and what makes convex subsets of vector spaces special in this more 
general context. Section [2731 is dedicated to the study of some aspects of convex 
sets in general as far as they are important for the present thesis. In Section 
12.41 we will focus on a particular class of convex sets called polytopes. In part 
HI1 of this thesis, we will consider generalized probabilistic theories whose sets of 
states are polytopes. 

2.1 Probabilistic mixtures and convex sums 

In generalized probabilistic theories, the set of states is commonly assumed to 
be a convex subset of a real vector spaceo As we will see in Section 12.31 this 
is a subset C of a vector space V such that for every two elements x and y of 
C, the line segment L x . y := {Xx + (1 — X)y | < A < 1} which connects x 
and y is contained in C as well. The goal of this subsection is to motivate this 
assumption. 

Suppose that we are given a physical system with initial state w. Assume 
that it undergoes a random process 1Z and that the state of the system after 
the random process depends on the outcome k of that random process. Subse- 
quently, we perform a measurement M on the system. To describe the statistics 
of the measurement outcomes, we need a description of the state of the system 
prior to the measurement, i.e. after the random process. However, it might be 
the case that we do not know the outcome k of the random process (on which 



2 Given that a quantum mechanical Hilbert space "H is a complex vector space, it might be 
confusing that the vector space is assumed to be real. However, the unit vectors of a quantum 
mechanical Hilbert space 7i only encompass the pure states. In full generality, the quantum 
state has to be treated as a density operator. The set of density operators S(H) is contained 
in the real vector space HermCH) of Hermitian operators on H (c.f. Example 1 2. 5 II . The reader 
unfamiliar with the concept of a density operator is referred to NCOOj . 



the state of the system depends). But if we know the statistics of the random 
process, i.e. the probabilities pk of the outcomes k of TZ, we can still make pre- 
dictions about the outcome of the subsequent measurement by describing the 
state of the system as the probabilistic mixture of the states {u>k}k- 

As an example, consider a quantum system with an initial state which is 
described by a density operator p E S(H). The random process 1Z prior to the 
measurement M might be a measurement as well. Say that this measurement 
is described by a projective POVM 1Z = {Pk}k- The probability of getting the 
outcome k is given by pk = tr (Pkp)- In this case (if the outcome is k), after the 
"^.-measurement, the system is in the state 

PkpPk 
Pk 



tr (P fc p) 



If the subsequent measurement M. is described by the POVM M = {Qi}i, 
the probability of getting the outcome /, conditioned on the outcome k of the 
TZ- measurement, is given by 



Pi\k = tr{Qip k ) = tr f Q v 



PkpPi 



tr{P k p), 

If we would not know the outcome of the random process, i.e. the outcome k 
of the 1Z- measurement, the probability that we would assign to the outcome I 
of the M -measurement would be the probabilistic mixture of the probabilities 
Pl\k, 

Pi = ^PkPi\k = ^2pktr(Qip k ) . (2.1) 

k k 

The linearity of equation (j2.ll) allows us to represent pi as 

Pi =^2pktr(QiPk) =tr I Qi I ^PkPk J j = tr(Qip), 

where 

P = Y^PkPk- (2-2) 

k 

The state p is the probabilistic mixture of the states {pk}k with probabilities 
{Pk}k- It is easily verified that every operator of the form (|2.2[) is a density 
operator. Any state which is a non-trivial mixture of other states is a mixed 
state. 

In a more general framework, we might assume that a physical theory de- 
scribes the set of states $7 of a system as a subset of some real vector space A 
(like the set S(H) of density operators on a Hilbert space H is a subset of the 
real vector space Herm(H) of Hcrmitian operators on TL, c.f. Examplc l2.5p . The 
probabilistic mixture of states {uik\k C VL with respect to probabilities {pk}k 
would then be given by 

ui = ^2p k uj k , where uj k E Q,, }^Pk = !■ (2.3) 

k k 



A linear combination of the form (|2.3I) is called a convex sum of the elements 
{iOk}k of CI. In order to treat the mixture w as a state, we need the consistency 
requirement that uj 6 Ci. Since we might think of random processes with any 
probabilities {pk}k which prepare any states {iVk}k, we require that any mixture 
of the form (|2.3[) is an element of CI. A subset CI of a vector space A which has 
this property is called a convex subset of the vector space. 

The requirement that convex sums of arbitrarily (but finitely) many elements 
of the set CI have to be contained in CI can be reduced to the requirement that 
the convex sum of only two elements has to be contained in CI. Clearly, if CI 
is a convex subset of A, then any convex sum of two elements of CI is again an 
element of CI. On the other hand, suppose that CI has the property that for any 
two elements of CI, any convex sum of the two elements is again an element of 
CI. Then, any convex sum of three elements is an element of CI as well: 

a,/3,7 € [0,1], a + ,5 + 7 = 1, u,a, veCl 

=>■ au> + (3a + 71/ = (1 — 7) u H a ) +7^ G CI . 

Vl-7 1-7 / 
s „ ' 

GO since T r^ + T 47 = l 

This argument extends to convex sums of arbitrarily many elements of CI. 
Therefore, we can characterize a convex subset C of a real vector space V 
by the property that for any two elements x and y of C, the line segment 
L x>y = {Air + (1 — X)y | < A < 1} which connects x and y is contained in C as 
well. We will give this property a geometric picture in Section 12.31 

2.2 More abstract notions of convexity 

It is very common to assume that probabilistic mixtures of states are given 
by convex sums. In a more general setting, however, it might be that the 
set of states is not a subset of a vector space, so that probabilistic mixtures 
cannot be expressed by linear combinations (a convex sum of the form (|2.3[) is 
a linear combination). It is interesting to examine how probabilistic mixtures 
could be generalized to this more general case. In other words, one might ask 
how restrictive it is to assume that probabilistic mixtures are given by convex 
sums. To this end, we recapitulate the ingredients that we put together to get a 
probabilistic mixture. We have a tuple (ujk)k of states (where all states belong to 
a common set of states CI) and a tuple (j)k)k of probabilities. These ingredients 
are combined to form a mixture ui of states (for the moment, we do not assume 
anything about how they are combined) . If we assume that the random process 
has n possible outcomes, we can regard this as an operation 

Sl"xA„ -> CI 

((wi,...,w„),(pi,...,j} n )) !->■ 2 



where 



A n :={( Pl ,..., Pn )e[0,lY 



E^ = 1 



As before in Section EOl with convex sums, if we do not consider any position in 
this operation as being distinguished from the others, we can regard this n-ary 
operation as emerging from a set of binary operations 

{cc\}xe[os] : ^ X fi -> 

which satisfy some compatibility requirements which allow for concatenating 
the binary operation in an associative way to get an n-ary operation. This idea 
is captured by the following definition. 

Definition 2.1 ( Fri09 ): A convex space is a set X equipped with a family 
{cca}ag[04] of maps 

cc x :XxX^X, 
which is called the convex combination, satisfying the following conditions: 

• cco(x,y) = y Vx, y € X , (2.4) 

• cc x {x,x) =x Vx e X, VA e [0, 1] , (2.5) 

• cc x (x,y)=cc 1 - x (y,x) Vx, y e X, VA e [0, 1] , (2.6) 

• ccx{cc fi (x,y),z)=cc^(x,cc fi (y,z)) Vx,y, z 6 X,V\,p e [0, 1] (2.7) 



with 



( A(1-m) 

A = Am , /i = < 1_ ^ 

I arbitrary, if A = /i = 1 . 



'/Am^ 1, 



It is not hard to convince oneself of the fact that if X is a convex subset of 
a real vector space, then the binary convex sum 

ccx{x,y) = \x+(l-\)y (2.8) 



satisfies the properties l|2.4[) - (12.71) and is therefore a convex space in the sense of 
Definition l2.il However, it turns out that there are convex spaces that cannot be 
realized as a convex subset of a vector space. Within a physical interpretation, 
they have, in a certain sense, a possibilistic rather than a probabilistic structure. 
They do not provide a quantitative measure for how likely it is that an event 
occurs but only give a qualitative "yes or no" -structure which says whether or 
not an event is possible. For example, the two-element set {i, /} together with 
the operation 

c n U ifA = 

CCxihf) = \i ifA^O 



satisfies all the axioms (|2.4p - (|2.7[1 for a convex space. This example looks 
pathological, but it arises as a special case of a more natural class of spaces of a 
combinatorial or possibilistic type. We will not discuss such spaces here, since 
we are interested in theories that give us quantitative predictions about the 
probability of events and are therefore of a probabilistic nature. For examples 
and a detailed discussion of possibilistic spaces, we refer to |Fri09j . 

Instead, we want to attend to the question of how convex subsets of real 
vector spaces can be distinguished from other types of convex spaces. This 
question has a mathematically clear answer provided by the following theorem. 

Theorem 2.2 (Stonqfl |Sto49], see |CFllj ): A convex space embeds into a real 
vector space with (|2.8I) if and only if the following cancellation property holds: 

cc\(x, y) = cc\{x, z) with A € (0, 1) =>■ y = z. (2-9) 

This gives us an explicit criterion which separates convex subsets of vector 
spaces from other convex spaces. Applied to state spaces of generalized prob- 
abilistic theories, it seems that there is no immediate physical interpretation 
of the cancellation property (|2.9[) . Nonetheless, it is good to be aware of the 
fact that from a very abstract point of view, the assumption that convexity is 
represented by convex subsets of vector spaces causes an (arguably small) loss 
of generality. 

In the following, we will always assume that the set of states in a physical 
theory is a convex subset of a real vector space. We will not refer to the more 
general notion of convexity of Definition 12.11 anymore. Whenever we will talk 
about convexity, we refer to convex subsets of real vector spaces, which we will 
often simply call convex sets. Therefore, when we say convex combination, we 
mean a convex sum. 

2.3 Convex subsets of vector spaces 

In this section, we introduce some general aspects of convex subsets of vector 
spaces. The presentation of convex sets that we give here is not to be understood 
as a standard introduction to the field. Instead, we discuss some aspects and 
prove some properties of convex sets which are important for our particular 
purpose. 

Definition 2.3: A subset C of a real vector space V is a convex subset or 
convex set if x, y E C implies Xx + (1 — X)y € C for all < A < 1. 

The property of being a convex set has a very simple geometric interpretation 
(see Figure l2~Tj) : For any two points x,y E C C V, the line segment 

L Xj y := {A.t + (1 - X)y | < A < 1} 



3 This theorem is originally by Stone |Sto49| . The version presented here is a modified, 
more modern version by Capraro and Fritz |CF11| . 
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Figure 2.1: The triangle, the square and the circular disk are convex, but the 
circular section depicted on the right is not. 

connecting the two points is contained in C as well. By what we have discussed 
at the end of Section 12.11 this also implies that any convex sum of elements in 
C is contained in C (and not just binary convex sums). It is easy to see that 
the intersection of two convex sets is convex. 

One of the very central notions in the study of convex sets is the notion of 
extreme points. The extreme points of a convex set C are the elements which 
cannot be represented as a non-trivial convex combination of other elements of 
C. 

Definition 2.4: A point e of a convex set C is an extreme point of C if 

e = ax + (1 — a)y with x, y E C and < a < 1 implies x = y = e. The set of 
extreme points of a convex set C is denoted by ext(C). 

For instance, the extreme points of a closed triangle (square) are the three 
(four) corners, whereas every point on the boundary of the closed circular disk 
is an extreme point. In contrast, an open ball or an open half-space in a vector 
space are both convex sets, but neither of them has any extreme point at all. 
We will see below (Theorem 12 .14j) that this is related to the fact that those are 
non-compact sets. 

Let us consider a more abstract and less trivial example of a convex set. 

Example 2.5 (The set of density operators): For a finite-dimensional Hilbert 
space H, the set S(TL) of density operators on 7i (the set of states) is defined 
by 

S(H) := {p E Herm(-H) | p > 0,trp = 1}, where 
p> :«*■ (ip\p\ip) > v|v>) e n. 

A Hcrmitian operator p with p > is called a positive operator. Thus, a density 
operator is a positive operator with unit trace. The set S(7i) is convex: 

< a < 1, p } TES(H) 
=> (ip\ap+ (1 - a)r|V>) = a{4>\p\ip) + (1 - a)(V>|r|V>) > 0, 
tr(ap + (1 - o)t) = a tr{p) + (1 - a) tr(r) = 1 
=^ap + (l-a)r e S(H) . 

The extreme points of 5(H) are the pure states, i.e. the density operators of 
the form p = \?p){ifj\ for some \ip) G %. In the case where H. is a two-dimensional 



10 



Hilbert space, the set S(H) can be visualized by the so called Block sphere. The 
name Bloch "sphere" is slightly misleading since the Bloch sphere is actually 
not a sphere but a unit ball. This is a neat visualization since it shows the 
convexity of S(T~L) in a geometric way. The boundary of the ball is given by the 
extreme points and therefore the pure states of the system. ■ 

Another very central notion in the study of convex sets is the notion of a 
face of a convex set. Roughly speaking, one might think of a face as some kind 
of convex "extreme subset" of a convex set. 

Definition 2.6: A nonempty convex subset F of a convex set C is called a face 
of C if ax + (1 — d)y G F with x,y G C and < a < 1 imply x,y G F . The set 
F is a proper face of C if F is a face of C and F ^ C ' . 

In other words, a face F of a convex set C is a face of C if every line segment 
in C with an interior poinl|f| in F is completely contained in F . By definition 
of an extreme point, if e G ext(C) is an extreme point of a convex set, then 
{e} is a face of C. The faces of a triangle are given by the triangle itself, its 
edges and corners, the proper faces of a cube are its six square sides, its edges 
and its corners. Note that the requirement that a face has to be convex makes 
a difference. If this requirement would be dropped, then any subset of ext(Af ) 
would be a face, but obviously, not every subset of ext(Af ) is convex (e.g. the 
union of two corners of a square is not convex). 

As we said above, when we compare the definition of a face with the definition 
of an extreme point, we could say that a face is some kind of convex "extreme 
subset" . It is not only an extreme set in terms of binary convex combinations 
but in terms of arbitrary convex combinations. We state this more formally. 

Proposition 2.7: Let F be a face of a convex set C, let iff. Let v\,...,v n 
be points in C such that there exists a convex combination of V\, . . . , v n with 
nonzero coefficients ax, . . . , a n which gives x, i.e. 

n n 

V* on 



iVi, for some on > with > on = 1 . 
Then v\, . . . ,v n G F. 



»=i i=i 



4 We say that an element z of a line segment L I|B is an interior point of the line segment 
if z = \x + (1 — \)y for some < A < 1. 
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Proof. We prove that for any j £ {1, . . . ,n}, we have that Vj 6 F. 



^T, 



y^ onvi = ctjVj + y pij'c, 







= ctjVj + av — x £ F with a,-, a > 0, Oj + a = 1. (2-10) 

The vector u is a convex combination of elements in C, so it is itself an element 
of G. Thus, by the definition of a face, (12.10)) implies that Vj £ F. □ 

Note that Proposition 12 . 71 in particular applies to the case where the face F 
consists of an extreme point, i.e. F = {e} for some e £ ext(C). Another simple 
but very useful property of faces is the following. 

Proposition 2.8: For a convex set C, a face of a face of G is itself a face of G. 

Proof. Let F be a face of G, let G be a face of F. Let x,y £ C and < a < 1 
such that ax + (1 — a)y £ G. Then qi + (1 — a)y £ F, so x, y £ F since F is a 
face of C. This means that we have x,y £ F, < a < 1 with ax + (1 — a)y £ G. 
The set G is a face of F, so this implies that x, y £ G. We have proved that 
x,y £ C and < a < 1 such that ax + (1 — a)y 6 G implies x,y £ G, so we 
have proved that G is a face of G. □ 

In the following, we will show a useful and intuitive property of faces of 
convex sets. Before we can state it, we have to introduce the affine hull of a 
set. 

Definition 2.9: Let M be a subset of a real vector space V . The affine hull 
of M, denoted by aff(Af), is defined by 



aff(M) := <^ Y a t v t 



£ {0, 1,2,.. .}, v % £ M, a,eM, Y, a, = 1 



A few examples: The affine hull of a point is the point itself, the affine hull 
of two points is given by the straight line through the two points, and the affine 
hull of a triangle, square or circle disk is the plane which contains it. With the 
definition of the affine hull at hand, the property we want to prove reads as 
follows. 

Proposition 2.10: If F is a face of a convex set G, then F = aff(F) n G. 

Proof. The inclusion F c aff (F) n G is obvious. For the other inclusion, let v 
be an element of aff (F) D C. Our goal is to show that v £ F. There is an affine 
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combination of finitely many elements of F which gives v: 

v = y, a i v i + /_, a j v j > where 
iei+ jei- 

oti > o Vi e i + , a.,- < o Vj e 7~ , 
J^ ^ + 5^ a,- = 1 . 



(2.11) 





ieH 


je-f- 






define 












«+:= 


E a i^ 
E «» 


i> 


E <W 
_ je/- 

E «j 
iez- 



This gives 



W/+ / ye*- / 



(2.12) 



The set F is convex, so we have that both v + and v~ are elements of F since 
they are given by convex combinations of elements of F. If I~ is empty, then 
by Equation (|2.12|) one has that v = v + and therefore v e F (which is what we 
want to show) . If I~ is non-empty, then 

1 



^a l >\ 



0< 



iei+ 



E a i 



0< 1- 



< 1 and 



< 1 



E <*! 

!6/+ 



(2.13) 
(2.14) 



Note that 



E on 

i£l+ 



£ oa - 1 
E «« 



(2.15) 



Inequalities (|2.13[> and f|2.14f) imply that the following is an interior point of the 
line segment L vv - from v £ C to v~ G F C C: 



1 



1 



E "i 



E «* 



I2J21 
J2~15l 



E ■ 



*j"j 



' E on 
\iei+ , 



E °Li 



E "i 



E 



^"j 



E 

jei- 



(2.16) 
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From (|2.11[) . we get that 







This allows us to reformulate the right-hand side of (|2.16p : 

E «i«j E «i E "i^j 

1 . .. 1 I _ + iei- jei- jei- 

1 1 == \ v = v ' 



E «* 1 E a i I E <^ E a * E a j 

ie/+ \ iei+ J ie.i+ je/+ is/ - 

= v + e F . 

The set F is a face of C, so by the definition of a face, v E F. The vector v 
is an arbitrary element of aff (F) n C, so we have shown that aff (F) f) C C F, 
which completes the proof. □ 

Next, we want to turn to a very central result in the study of convex sets. It 
states that a compact convex set is the convex hull of its extreme points. This 
needs some preparation. At first, we need to know what the convex hull is. For a 
subset M of a real vector space V, the convex hull of M can be characterized as 
the smallest convex subset of V which contains M. This set can be obtained by 
taking all convex combinations of points in M. The following definition states 
this more formally. 

Definition 2.11: For a subset M of a real vector space V, the convex hull of 
M , denoted by conv(M), is defined by 






conv(M) ■= \ / onv 



ne {0,1,2,...}, Vi G M, q, G [0,1], ^q, = 1 



The definition of the convex hull reads similar to the definition of the affinc 
hull (c.f. Definition I2.9J) . The only difference is that the coefficients in the sum 
are positive (instead of just real). Note that conv(M) C aff(Af) is true for any 
set M. 

Another thing we have to understand is what it means for a subset of a 
vector space to be compact. Compactness is a topological property, and so far, 
we have not defined a topology. In finite-dimensional vector spaces, however, 
there is a canonical topology, as we will see below. This is very practical since we 
will restrict ourselves to the finite-dimensional case. Readers who are interested 
in the more general, infinite-dimensional case are referred to Appendix [XJ We 
only state the following definition and theorem to show that we can refer to 
basic topological notions without explicitly defining a topology (we will refer to 
compactness and closedness of sets). We will not refer the notions of topological 
vector spaces or Hausdorff spaces again, so the reader unfamiliar with these 
concepts will not have any problems while reading this thesis. 



14 



Definition 2.12: A real topological vector space is a real vector space V 
equipped with a topology such that the vector addition V x V — > V and the scalar 
multiplication M X V — > V are continuous. 

Theorem 2.13 (see [SW99, Chapter 3]): For a finite-dimensional real vector 
space V, there is a unique Hausdorff topology on V with respect to which V is 
a real topological vector space. 

This means that in the case of a finite-dimensional vector space V, we can 
refer to topological properties of subsets of V without explicitly specifying a 
topology on V. Now we are ready for the theorem. 

Theorem 2.14 (Minkowski, see [Web941 Theorem 2.6.160): Let V be a finite- 
dimensional vector space and let C be a compact convex subset of V. Then C 
is the convex hull of its extreme points: 

C = conv(ext(C)) . 

In particular, C has extreme points. 

The reader who wants to see how this theorem can be generalized to the 
infinite-dimensional case is referred to Appendix |A"1 Here, we restrict ourselves 
to the finite-dimensional case. Theorem 12 . 141 states that a closed convex set is 
fully specified by its extreme points. When we apply this to a set of states fl 
(which we will assume to be a compact convex subset of a finite-dimensional 
vector space), this gives us a physical interpretation (which we will discuss in 
Section I3.5|) . The extreme points of fl will be called pure states, and they 
correspond to maximal knowledge about the system. According to Theorem 
12.141 all states of incomplete knowledge (i.e. the states which are not extreme) 
can be represented as a probabilistic mixture of states of maximal knowledge. 

The next thing we want to learn is that a closed convex subset C of Hilbert 
spaces H allows for a distance function d( ■ , C) : % — > R. To define this 
function, we use the famous Hilbert Projection Theorem. 

Theorem 2.15 (Hilbert Projection Theorem, see |Wer071 Satz V.3.2]): Let U 

be a Hilbert space, C C H closed and convex and x G H . Then there is a 

unique xq G C such that \\xq — x\\ = inf \\y — x\\. In this case, we define 

yec 

d{x, C) := inf \\y — x\\. 
yec 

2.3.1 Convexity-preserving maps 

To conclude the introduction to general convex sets, we want to turn to the 
question when two convex sets are equivalent. The structure in question is the 



5 In [Web94l Theorem 2.6.16], this theorem is referred to as the Krein-Milman Theorem, 
which is not correct since it has been proved by Minkowski. As described in Appendix fX] 
the Krein-Milman Theorem is a statement about a more general case in infinite-dimensional 
vector spaces. 
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convexity structure of the two sets. To investigate whether two sets show the 
same convexity structure, it is convenient to introduce a map which conserves 
this structure. We call such a map a convex-linear map. As we will see below, 
this is the same as an afhne map. This kind of map will be important in the proof 
of our main result in Section 15.21 For the reader interested in the uniqueness 
of abstract state spaces (we will come back to this issue in Section 13. 2\i , afhne 
maps play a central in the proof of the equivalence of compact convex sets and 
abstract state spaces presented in Appendix [B] 

We start with the definition of a convex-linear map. Simply speaking, this 
is a map which commutes with the action of taking convex combinations, so it 
preserves the convexity-structure. In formal terms, this reads as follows. 

Definition 2.16: A map f : V —¥ W between finite- dimensional real vector 
spaces V and W is convex-linear if 

f(Xx + (1 - X)y) = Xf(x) + (1 - X)f(y) \/x, y E ^, VA E [0, 1] . 

Two convex subsets S C V and T C W are convex-isomorphic if there is a 
bijective map f : S — >• T which extends to a convex-linear map f : V — > W. 

Convex-linearity exactly represents our intuition for the "conservation of 
the convexity-structure" . However, it turns out that we could have defined 
the property that a map "conserves the convexity-structure" in a (seemingly) 
stronger way without loss of generality, as we see in the following. 

Definition 2.17: A map f : V — » W between finite- dimensional real vector 
spaces V and W is affine if 

f(Xx + (1 - X)y) = Xf(x) + (1 - A)/(y) Vs, y E V, VA e R . 

The difference to Definition l|2.16|l is that the scalar A can be any real number 
instead of only an element of [0, 1]. 

Proposition 2.18: Every convex-linear map is affine. 

Proof. Let / : V — > W be a convex-linear map, let x, y E V and let A E EL If 
A E [0,1], then 

f(\x + (1 - X)y) = \f(x) + (1 - X)f(y) . 

If A ^ [0, 1] we can assume without loss of generality that A < 1 (in the other 
case where A > 1, we can simply interchange the role of x and y). We can write 
y as the following convex combination: 

y = -±-(Xx + (l-X)y) + (l 



1-X 
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This allows us to write 

M = / (j^x {Xx + (1 _ x)y) + ( l ~ Y^x ' ' 

The map / is convex-linear, so 

f(y) = y^x K Xx + C 1 " A ^ + f 1 - I3a ' /( ■' ' 



«• Y^/(A^ + (i - X)y) = f(y) + j^m 

«. /(As + (1 - A)y) = A/(x) + (1 - A)/(y) . (2.17) 

We have proved that Equation (|2.17[) holds for all £, y £ V and for all A G E, 
so / is afhne. D 

For practical purposes, as well as for the intuition for convex-linear maps, 
it is useful to see that a convex-linear map can always be represented by the 
action of a linear map followed by a translation. This is the statement of the 
following theorem. 

Theorem 2.19 ( |Web941 Theorem 1.5.2]): A map / : V -> W between finite- 
dimensional real vector spaces is affine (by Proposition ^. 18l we can equivalently 
say convex-linear) if and only if it is of the form 

f(x) = L(x) + y for some y £ W and for some linear map L : V —> W . 

Now we show two propositions which will be helpful for the proof of the 
main result in Section [ 



Proposition 2.20: Let / : A — )• B be an affine or convex-linear map. Then / 
commutes with arbitrary affine combinations. More precisely, for any x\ , . . . , x n G 
A, on has that 

(n \ n ri 

2^o.iXi = / jOtjfixj) for any real numbers (Xi with } ttj = 1 . 
i=\ ) i=\ i=\ 

In particular, /(aff (M)) = aff(/(M)) (and since every convex combination is 
an affine combination, we also have /(conv(M)) = conv(/(M))) for any subset 
M of A. 

Proof. Let / : A — > B be affine, i.e. 

f(Xx + (1 - \)y) = Xf(x) + (1 - A)/(y) Wx, y G A, VA e K . (2.18) 
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Let Ei=i a i x i be any affine combination of elements x\, . . . ,X„ E A. Then 



/ ^2 a * x * = f 



a\X\ 



£• 



V 



E 



IjUtj 



i=2 



I2T81 



«i/(*i) + (f> j / E 



vi=2 



v^ 2 IS* 



/ 



nf(xi)+ E a * M 



ai/(a;i)+ E a < M 



a 2 



vvS a< 



j=3 £ a, 



i=2 



si=2 



a 2 



E a < 

( = 2 



•i'2 



E a fc 

fe=3 



E «< 

).=2 



ism ., ,, Nr 



Q 2 



vi=2 



E a J x 3 
/ n 

^= 3 E a k 

Vfc=3 
/(^) 



E a > 

i=1 



E «fe 

fc=3 



E a - 

i=2 



/ / / n 



•?= 3 ( E a k 
,fc=3 



ai/(xi) + 02/(3:2) + . . . + a n f(x n ) ■ 



D 



Proposition 2.21: Let / : A — > B be an affine map. Then the following 
statements are equivalent: 

(a) The map / is injective. 

(b) The map / maps affinely independent points to affinely independent points|j 

Proof. We prove the two implications separately. 



6 As we will see in Definition 12.251 a subset of a vector space is affinely independent if no 
element of the subset lies in the affine hull of the other elements of the subset. 
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(a) =>■ (b): Let / : A — > B be an injective affine map. We prove the 
contraposition: Let xi, . . . ,x n £ A such that f(xi), . . . , f(x n ) £ B are 
affinely dependent. This means that there exists a k £ {l,...,n} such 
that 

n n 

/ p/-if{xj) = f{xk) for some real numbers cti with 2_^ a ' b = ^ ■ (2-19) 

i=l i=l 

The map / is affine, so we can rewrite Equation (J2.19I) : 

( , \ 



f 



E 



OiiXi 



\}jtk ) 



f(x k ). (2.20) 



We have assumed that / is injective, so Equation (|2.20[) implies that 

n n 

/ Q-i<Ej — Xk , 7 Oij 1 , 
i=\ i=\ 

so Xi, ■ . . ,x n € A are affinely dependent. 

• (b) =>• (a): Assume that / : A — >• B is an affine map that maps affinely 
independent points to affinely independent points. Let x, y £ A be such 
that f(x) — f(y). This means that f(x) and f(y) are affinely dependent 
(since each side of the equation f(x) = f{y) can be seen as the trivial affine 
combination of one element). By the assumption that / maps affinely 
independent points to affinely independent points, this means that x and 
y must be affinely dependent. The only affine combination is x = y, so / 
is injective. □ 

2.4 Polytopes 

Now we investigate a special class of convex sets which are called polytopes. As 
for the previous subsection, this is not a standard introduction to polytopes. 
Instead, this subsection is aimed at understanding and proving some particular 
properties of polytopes which will be important in the sections of Part [II] of this 
thesis. 

Definition 2.22: A compact convex subset P of a finite- dimensional real vector 
space is a polytope if ext(P) is a finite set. For a polytope P , an element of 
ext(P) is called a vertex of P (pi.: vertices). 

Note that the requirement that the set P has to be compact makes a big 
difference: The nonnegative numbers in R or an interval of the form [a, b) in R 
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are both examples with finitely many (namely one) extreme points, but neither 
of them is a polytope since they are not compact. 

It is assumed in the definition of a polytope that the vector space containing 
the polytope is finite-dimensional. We show in Appendix [X] that this does not 
cause any loss of generality. 

A polytope can equivalently be characterized as the convex hull of finitely 
many points. To see this, we make use of the following result of Caratheodory. 

Theorem 2.23 (Caratheodory [CarQ7j . see [Gru671 Chapter 2.3]): If C is a 
compact subset of a finite-dimensional real vector space, then conv(C) is closed. 
In other words, for compact C we have conv(C) = conv(C). 

Proposition 2.24: A subset P of a real vector space is a polytope if and only 
if it is the convex hull of finitely many points. 

Proof. A polytope P is by definition a compact subset of a finite-dimensional 
vector space. This allows us to apply Theorem 12.141 which implies that P = 
conv(ext(P)), where ext(P) is a finite set by the definition of a polytope. For 
the other direction, let M be a finite set. Then M is trivially compact, which 
by Theorem 12.231 implies that conv(M) is closed. M is finite, so conv(M) 
is also bounded. In finite-dimensional spaces, being closed and bounded is 
equivalent to being compact, so conv(M) is compact. The set conv(M)\M does 
not contain any extreme points of conv(M) (as one can see from the definition of 
an extreme point), so ext(conv(M)) C M which is a finite set. Thus, conv(M) 
is a polytope. □ 

Next, we want to introduce the notion of the dimension of a polytope. It is 
defined as the dimension of the affine hull. 

Definition 2.25: We define the dimension dim(aff(M)) of an affine hull 

aff(Ajf) of a subset M of a finite- dimensional vector space V as n + 1, where n 
is the maximal cardinality of a subset of aff(Af) such that the subset is affinely 
independent. A subset of a vector space is affinely independent if no element 
of the subset lies in the affine hull of the other elements of the subset. 

With this definition at hand, we can characterize polytopes by the dimension 
of their affine hull. A few examples are shown in Figure 12.21 

Definition 2.26: The dimension d of a polytope is the dimension of its 
affine hull, d :— dim(aff(P)). The dimension of the empty polytope {} is defined 
to be — 1. A d-dimensional polytope is called a d-polytope. A 2-polytope is a 
polygon, a 3-polytope is a polyhedron. 

Polytopes can be given an intuitive geometrical picture: they are the inter- 
section of finitely many closed half-spaces which are positioned in a way such 
that their intersection is bounded. In more technical terms, this reads as follows. 
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Figure 2.2: From left to right, we have a 0-polytope (which is nothing but a 
point), a 1-polytope (in other words, a line), two 2-polytopes (polygons) and 
two 3-polytopes (polyhedra). 

Definition 2.27: A subset P of a finite- dimensional real vector space V is 
called a polyhedral set provided that P is the intersection of a finite family of 
closed half- spaces in V . 



Theorem 2.28 ( Gru67, Chapter 3.1]): A subset P of a finite-dimensional real 
vector space V is a polytope if and only if P is a bounded polyhedral set. 

Now we are ready to give a good picture for the faces of a polytope. It turns 
out that a subset F of a polytope P is a proper face of P if and only if it is the 
intersection of P with an affine hyperplanqll which touches P but which does 
not cut P. This result is established by the following two propositions. 

Proposition 2.29: If F is a face of a compact convex subset P of a finite- 
dimensional vector space V, then F = conv({v £ ext(P) | v £ F}). 

Proof. Obviously, conv({u £ ext(P) | v £ F} C F since F is a convex set. 
For the other inclusion, let w £ F. By Theorem I2.14[ one has that P = 
conv(ext(P)). Let 



w 



^2 am, 

i=l 



a* > 0, Vi £ ext(P) V* £ {1, . . . , n} 



(2.21) 



be any convex combination of extreme points of P with nonzero coefficients 
which gives w. By Proposition 12.71 vi,...,v n £ F. Hence, for every w in F, 
it holds that every convex combination of extreme points of P which yields w 
is a convex combination of extreme points that are elements of F. This proves 
F C conv({v £ ext(P) | v £ F}). □ 

Proposition 2.30: For a non-empty convex subset F of a polytope P, the 
following are equivalent: 

(a) F is a proper face of P. 

(b) There is a closed half-spacqj H containing P such that F = P n dH, where 
dH is the affine hyperplane defined by the boundary of H . 

7 We say that a subset M of a real vector space V is an affine hyperplane if there is a 
nonzero linear functional / € V* and at£l such that M = {v £ V \ f(v) = k}. 

8 We say that a subset H of a real vector space V is a closed half-space if there is a nonzero 
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Proof. We prove the implications (a) =>■ (b) and (b) => (a) separately. 

• (a) =* (b): 

Let if = conv({e G ext(P) | e ^ P})- We prove (a) =4> (b) in three steps: 

(i) At first, wc show that FC\K = $. 

(ii) Then we show that P n if = implies the existence of a linear 
functional / which takes a constant value k on F and satisfies f(y) < 
k for all v G K . 

(iii) Finally, we show that H = {v G V \ f(v) < k} has the desired 
properties of (b). 

Before we prove the three steps, we make a few definitions. P is a polytope, 
so it has finitely many, say n, extreme points. We define {ei, . . . , e n } to 
be the extreme points of P, i.e. 

cxt(P) = {ei,...,e n }. (2.22) 

Moreover, we define two index sets If, Ik by 

I F = {i€{l,...,n}\ei€F}, (2.23) 

lK = {j€{l,...,n}\e j <£F}. (2.24) 

Now we prove each of the three steps. 

(i) Recall that we have defined K = conv({e G ext(P) | e ^ P}). By 
Equation (|2.24p . we have K = conv({ei}i e i K ). Suppose that there is 
an G KC\F . The vector x is in P, so there is a convex combination 

x= J2 a i e 3' a i - °' Yl a J= l - 

The vector x is an element of the face F, so Proposition 12.71 implies 
that ej G F for all j with a>j > 0. This leads to a contradiction since 
we have assumed that ej ^ F for all j € Ik- This means that there 
cannot be an element x £ F P\ K, i.e. F n K = 0. 

(ii) F is a proper face, so Proposition 12.291 implies that K is nonempty. 
By Proposition 12.241 if is a polytope since ext(P) is finite, so K is 
compact and convex. The fact that K — P n K implies 

aff(P) n k = aff (P) n (P n k) = (aff(P) n P) n K . (2.25) 

We know from Proposition 12.101 that aff(P) n P = P, so 

(aff (P) n P) n k = f n k ( = } , (2.26) 



linear functional / € V* and a fc £ R such that H = {v 6 1/ /(f) < fe}. The boundary of 
the half-space is given by dH = {u 6 V | /(f) = fe}. 
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where the last equality has been shown in the first step of the proof. 
Equations (TT25)) and ([2~26| imply that aff(F) n K = 0. 
All in all, we have that K is nonempty, compact and convex, aff (F) 
is closed and convex, and aff(F) n K = 0. This allows us to apply 
the separating hyperplane theorem. It says that there is a linear 
functional / with 

sup{/0) | v G K} < inf{/0) | v G aff(F)}. (2.27) 

aS(F) is an afhne hull on which the linear functional / is lower 
bounded by sup{/(u) | v G K}. This implies that / is constant 
on aff(-F): If there were y,z € aff(F) with f(y) < f(z), then by 
choosing a large enough scalar a we would have that 

f(ay + (1 - a)z) = f(z) + a(f(y) - f{z)) < max{j» v G K} . 

S v ' S v ' 

Gaff(_F) <0 

Let k :— f(v) for some v G &ff(F) be the constant value that / takes 
on aff(T). From (ET2T)) it follows that f(v) < k for all v G K. 

(iii) Let V denote the vector space containing the polytope P, let H := 
{v eV \ f(v) < k}. Recall from (f2T22|) that ext(P) = {ei, . . . , e„}. 
Let a be an arbitrary element of P. Then 

n n 

a = \ onet for some numbers ai > 0, > a\. = 1. (2.28) 

i=l i=l 

From the Definitions (|2~2^| and (12^2^)) . we see that J F U//c = {1, . . . , n}. 
With these definitions, (|2.28p reads 






a % ei + > a 9 e, 



and therefore, by what we have shown in step (ii), we obtain 
f(a) = Yl a ' 1 -^ e *) + Yl a i ^ e ^ - k ' 

with equality if and only if a G F (since X)"=i a i ~ •"•)• This proves 
P C H and F = P n dH, where dH = {v G V \ f(v) = k}. 

• (b) ^> (a): Let the vector space containing P be denoted by V. Let H 
and 9if be sets of the form H — {v G V | f(v) < k] and dH = {v G V | 
/(w) = /c} for some linear functional / and some k G K. Let F — Pn dH. 
Suppose there are i,y£P, 0<a<l such that ax + (1 — a)y G F . Then 

f(ax + (1 - a)y) = a f{x) +(1 - a) f(y) = k 

<k <k 

=>• f(x) = f(y) = k ^x, y eF. 

Thus, F is a proper face of P. □ 
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Note that the equivalence stated in Proposition 12.301 does not hold in the 
more general case of convex sets. In R 2 , let C be the union of lower open half- 
space and the non- negative a;-axis (which is a convex set), let F consist of the 
origin. Then F is a face, but F is not of the form F = C dH for some closed 
half-space H. Proposition 12.301 has the following immediate consequence. 

Corollary 2.31: A face F of a polytope P is a polytope. 

Proof. The improper face F = P is by assumption a polytope. If F is a proper 
face, then it is the intersection of P with an afhne hyperplane dH. P is compact 
and dH is closed, so F is compact. Proposition l2.29l implies that ext(F) is finite, 
so F is a polytope. □ 

Since we have defined the dimension of a polytope, Corollary 12.311 suggests 
the definition of the dimension of a face. 

Definition 2.32: The dimension of a face F of a polytope is the dimension 
of F as a polytope. We say that a face of dimension d is a d-face. If F is a 
(d — l)-face of a d-polytope, then F is called a facet of P. 

Now we have enough technical background to consider a very central class 
of polytopes which are called simplices. They will be very important when we 
deal with generalized probabilistic theories. The theories with simplices as sets 
of states are precisely the classical theories, as we will explain in Example 13.291 

Example 2.33 (Simplices): A <i-simplex is the convex hull of d + 1 affinely 
independent points. 





Figure 2.3: From left to right, we have a 0-simplex (a point), a 1-simplex (a line), 
a 2-simplex (a triangle) and a 3-simplex (which is also called a tetrahedron). 

For every d G N, there is precisely one type of d-simplexO From Proposition 
12.291 we see that a face of a simplex is again a simplex since any subset of an 
affinely independent set of points is affinely independent. Given a d-simplex, 
we can easily construct a (d + l)-simplex. We simply have to add an affinely 
independent point and take the convex hull of this point and the d-simplcx. For 
example, a tetrahedron can be constructed from a triangle by adding a point 



9 Two d-simplices can be bijectively mapped to each other by an afhne map. By what we 
have discussed at the end of Section 12.31 this means that any two d-simplices have the same 
convexity-structure. 
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which is affinely independent of the triangle and taking the convex hull of the 
point and the triangle. I 

A characterizing property of simplices is the uniqueness of the convex com- 
bination of extreme points which gives an element of the simplex. 

Proposition 2.34: For a d-polytopc P, the following are equivalent: 

(a) P is a simplex. 

(b) Every element x G P is a unique convex combination of extreme points of 
P. 

Proof. We prove the two directions separately. 



• (a) —> (b): Let P be a d-simplex with extreme points ext(P) = {ei, . . . , ed+i} 
and let 

d+l d+l 

x = ^a i e i = ^p i e i (2.29) 

i=l i=l 

be two convex combinations of extreme points which yield x. Suppose 
that for any k £ {1, . . . , d + 1}, we have that at ^ Pk- Then, from (|2.29[) . 
we can construct an afhne combination 



d+l R 

V^ Pi - 

i=i 

i=£k 
d+l Q 

sr^Pi — OLi 

i^k 


e,- with 

-Pk 

(l-Afc)-(l- 


- ak) 


ak - Pk 





= 1. 



But this is impossible since P is assumed to be a simplex, for which (by the 
definition of a simplex) ext(P) = {e±, . . . , ed+i} is an affinely independent 
set. Therefore, a^ = /3j. for every k G {1, . . . , d+ 1} since k was arbitrary, 
so the convex combination of extreme points which gives x is unique. 

• (b) —> (a): We have to show that |ext(P)| = d+l and that ext(P) is 
affinely independent. Say that ext(P) = {ei, . . . , e„}. Suppose there is a 
k G {1, . . . , n} such that there exists an affine combination 



^2a t ei = e k . (2.30) 



(=i 
i^k 
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From this we can construct convex combinations: 

l£ := {i £ {1, . . • , n} | i ^ k, a* > 0} , 

1^ '•= {i € {1, . . . ,n} | i ± k, an < 0} 

=> 22 aiei = &k ~ z2 a i e i 

/ \ / \ / 

1 



*6^ 












\ 



V ^i J 



(2.31) 



It is easily checked that both sides of equation (|2.31[) are convex combina- 
tions. They obviously differ since the convex combination on the left side 
does not contain ek whereas the one on the right side does. This contra- 
dicts the assumption that every point in P is a unique convex combinations 
of elements of ext(P). Therefore, there cannot be a fc £ {1, ...,n} such 
that (|2.30p holds which proves that ext(P) is an affinely independent set. 
The set P is a d-polytope, so ext(P) must contain precisely d+1 affinely 
independent points. □ 

Above, we have developed a half-space- and hyperplane-picture for polytopes 
and faces. Now, we prove a property of polytopes which will in turn allow us 
to prove a very important proposition for abstract state spaces in Section [3J 

Proposition 2.35: Let P be a polytope. Assume that P can be represented as 
the intersection of a given finite set of closed half-spaces, P = f] ieI Hi. Let F 
be a face of P. Then there is an I £ I such that F is contained in the hyperplane 
defined by Hi, i.e. F C dHi. 

Proof. Let the vector space V containing P be equipped with any inner product, 
turning V into a Hilbert space. An affine hyperplane is a closed and convex 
subset of V, so according to the Hilbert Projection Theorem I2.15[ 



d(-,Hi): V 



x M> mm \\x — y\ 



is well-defined for every i £ I. By Corollary 12.311 F is a polytope, so ext(P) is 
a finite set. Let n := | ext(P)|, say ext(P) = {ei, . . . , e„}. Let 



" 1 
ZZ-e^F. 

z — J n 



).= 1 



The index set / is finite, so D x := vam. ie j d(x, dHi) exists. Let w £ V be a unit 
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vector normal to Jl 10 l Consider the line segment 

L := {X(x + D x w) + (1 - X)(x - D x w) | A E [0, 1]} . 

The face F has to be contained in every half-space Hi (otherwise F would not 
be contained in P = f] ieI Hi). Moreover, we have that 

d(x,x + D x w) = ||ar — (x + D x w)\\ = D x \\w\\ = D x = xma.d{x, dHi) 

<d{x,dHi) for alii el. 

In words, we have just shown that for every half-space Hi, it holds that 

• x is in Hi and 

• x + D x w is closer to x than dHi is to x. 

This implies that (x + D x w) € Hi for every i e I and therefore (x + D x w) E 
P = C\iei H i- Analogously, (x - D x w) E P. 

The element of the line segment L corresponding to A = i is contained in F. 
The set F is a face of P, so by the definition of a face, the whole line segment L 
is in F. The vector w is normal to F, so L can only be contained in F if D x = 0. 
By the definition of D x , this implies that there is a hyperplane Hi 6 {Hi}i e j 
whose boundary dHi contains x. The fact that x € dHi means that there is a 
linear functional / € V* and atgl such that Hi = {v G V \ f(v) < k} and 
f(x) = k. It holds that F C P C Hi, so f(ej) < k for all j E {1, . . . , n}. Recall 
that {ei, . . . , e„} := cxt(F). 

n 

k = f{x) = Y J -f^j) => f(e j ) = k Vje{l,...,n} 

=> ej E dHi Vej E ext(F), dH t convex =J> F C H i . D 

Note that the statement of Proposition l2.35l would be false if the set of closed 
half-spaces would not be assumed to be finite. This is shown in the following 
example. 

Example 2.36 (The square as the intersection of closed half-spaces): We show 
two different representations of the square P = {(x,y) E R 2 | < x,y < 1} as 
the intersection of closed half-spaces (c.f. Figure [2^4)1 . 

(a) In the first example, the square is represented as the intersection of four 



10 Such a vector w exists: According to Proposition 12 . 30l there is a linear functional / 6 V* 
defining a hyperplane dH which contains F. Then, the vector w for which f (■) = (■ ,w) 
(which exists by the Ricsz Representation Theorem) is normal to F. 
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half-spaces: 

H4, = {(x,y) el 2 | x >0}, 
H 3 = {{x,y)eR 2 \x<l}, 
H 2 = {{x,y)eR 2 \y<l}, 
H Q = {(x,y)eR 2 \y>0}, 
7 = {4,3,2,0}, 

P = f| Hi . (2.32) 

In this case, Proposition 12.351 applies. For each facet of the square (i.e. for 
each edge of the square) , there is an i G / such that the boundary dHi of Hi 
contains the face. For example, the bottom facet of the square is contained 
in dHo. 

(b) In this example, the square is given by the intersection of the following 
infinite family of half-spaces: 



Hi-- 


= {(x,y)£R 2 \x>0}, 




H 3 -- 


= {(x,y)£R 2 \x<l}, 




H 2 - 


= {(x,y)eR 2 \y<l}, 




H 1/n = \{x,y)eR 2 \y>- 


1 



for every n <G N . 



/' = 4 4,3,2,1, -,-,-, 

' ' ' ' '234 



p=n*< 



iei' 

In this case, Proposition ^. 35l does not apply. Indeed, there is no i G I' such 
that the bottom facet of the square is contained in dHi since ^ /'. 

Another important property of polytopes concerning their representation as 
the intersection of half-spaces is shown in the following theorem. 

Theorem 2.37 ( Grvi67, Chapter 3.1]): Each polytope P is the intersection 
of a finite family of closed half-spaces (c.f. Theorem 12. 28|) . The smallest such 
family consists of those closed half-spaces containing P whose boundaries are 
the affine hulls of the facets of P. 

Note that Theorem 12.371 does not imply Proposition 12.351 (with the word 
"face" replaced by "facet"): Theorem 12.371 only states the existence of a family 
of half-spaces such that every facet is contained in one of the boundaries of the 
half-spaces, but it does not say that for a given intersection, there must be a 
half-space with this property. 
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(a) 




dH 2 



dH 



(b) 



8Ha 



dH* 




Figure 2.4: Example (a) shows the square as the intersection of four half-spaces. 
In this case, every facet of the square is contained in the boundary of one of 
the four half-spaces. In Example (b), however, there is no half-space whose 
boundary contains the bottom facet of the square. I 

Let us get back to some intuitive properties of polytopes. We would expect 
that if P is a d-polytope, then for every integer < k < d, P has a k-i&ce. This 
is indeed the case. More than that, the number of fc-faces can be lower bounded 
by a positive number. 



Theorem 2.38 ( Gru67, Chapter 3.1]): Let P be a d-polytope, and for every 
integer < k < d, let fk (P) be the number of k- faces of P. Then 



h(P) > 



In particular, for every < k < d, P has a fc-face. 

With the aid of Theorem 12.371 and Proposition I2.35[ we can prove another 
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very intuitive and useful property of polytopcs. 

Proposition 2.39: Let P be a polytope and let F be a proper face of P. Then 
there exists a sequence 

P = F D F±D ...D F k =F (2.33) 

of faces of P such that Fj.fi is a facet of Fi for every i 6 {0, . . . , k — 1}. 

Proof. By induction, it is sufficient to show that F is a proper face of a facet 
of P. Let {_ffi}i S / be the finite family of half-spaces whose boundaries are the 
affine hulls of the facets of P. By Theorem 12.371 P = f] ieI Hi- By virtue of 
Proposition 12.351 there is an I s / such that F C dHi. Proposition 12 . 101 implies 
that F\ = dHi n P is a facet of P. We know from Proposition 12 . 301 that there is 
a half-space H containing P such that dHP\P = F. It follows that F\ C H and 
FiHdH — F, so F is a face of Pi. This proves the existence of a sequence (|2.33p 
of subsets such that Pj+i is a facet of F^ for every j e {l,...,fc — 1}. By 
Proposition 12.81 those subsets are all faces of P. □ 



Before we dedicate ourselves to a few more properties of polytopes, this is a 
good point to introduce another example class of polytopes. 

Example 2.40 (Pyramids): A d-pyramid is a d-polytopc P such that there 
is some (d — l)-polytope B, called the base of the pyramid, such that P = 
conv(P U {a}) for some point a G P, which we call the apex of the pyramid 
(c.f. Figure[ 





Figure 2.5: From left to right, we see a 0-, 1-, 2- and two 3-pyramids. 

For d — 0, 1,2, there is only one type of d- pyramid simply because there 
is only one type of (—1)-, 0- and 1-polytope (which serves as a base for the 
d-pyramid). For d > 3, there are infinitely many different types of d-pyramids. 
If the base B is a square and the apex a is positioned centrally above B, we 
have the usual standard three-dimensional pyramid where this type of polytope 
gets its name from. ■ 

Now that we have just introduced pyramids, it is worth proving a lemma 
which will be useful in Section[5l To prove it, we make use of the following fact. 

Lemma 2.41 f |Grii671 Chapter 3.1]): If P is a ci-polytope, then each (d— 2) -face 
P of P is contained in precisely two facets Pi and F 2 of P, and F = Fi D F 2 ■ 
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This allows us to prove a fact about facets of pyramids that we naturally 
expect. 

Lemma 2.42: Let P be a pyramid with base B and apex a, let F be a facet 
of B. Then conv(F U {a}) is a facet of P. 

Proof. F is a (d — 2)-face of P. By Lemma r2.411 there are precisely two facets of 
P containing F, one of which is B. Let the other facet of P containing F be de- 
noted by G. We see from Proposition 12 .291 that G contains more extreme points 
of P than F. These additional extreme points have to be affinely independent 
of F since a,ff(F)P\P — F by Proposition ^. 101 Consider one of these additional 
extreme points and let it be denoted by p. Then aff(G) = aff (F U {p}) since G 
is a (d— l)-polytope and F is a (d — 2)-polytope. If p £ B, then we have 

aff (G) = aff (F U {p}) = aff (B) , 

which (by Proposition 12. 10[) implies 

aff (G) (IP = aff (B) C\P = B 

and therefore G = B. This contradicts our assumption that G ^ B, sop cannot 
be contained in B. The only extreme point of P which is not contained in B is 
a, so p — a and therefore G = conv(F U {a}). □ 
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3 Generalized probabilistic theories 

In this section, we introduce a framework of generalized probabilistic theories 
which generalizes classical theory and quantum theory to a more general set- 
ting. It is important to note that there is no standard framework for generalized 
probabilistic theories which is used overall. Instead, a few different frameworks 
have been considered. They mostly differ in the strength of their physical as- 
sumptions and the degree of the mathematical generality. The mathematical 
structure we are using in this thesis has been called the abstract state space, 
see for example [BBLW08J . |BGW09j . }BW09aj . |BW09bj . This section involves 
many mathematical definitions, but we will clarify their physical relevance by 
making examples concerning quantum theory, classical theory and some other 
special cases of generalized probabilistic theories. 

We start with Section T3 . 1 1 where we give an introduction to cones and ordered 
vector spaces, which form the mathematical structure of abstract state spaces. 
In Section 13.21 we provide a derivation of the abstract state space formalism 
from physical assumptions. To our knowledge, such a derivation of the abstract 
state space formalism has not been published so far, so it is worth introducing 
the ideas behind the framework in this thesis. We will discuss a one-to-one 
correspondence between compact sets of states and abstract state spaces. Basi- 
cally, abstract state spaces are the extension of the set of normalized states to 
the subnormalized states. At this point of our discussion, the state normaliza- 
tion is only a mathematical issue. Section 13.31 is dedicated to the definition of 
measurements on abstract state spaces and the investigation of their structure 
and properties. In Section [3.4[ we will give a physical interpretation of the state 
normalization. Section 13.51 is an attempt to give pure states a distinct physical 
interpretation. Finally, we will discuss transformations on abstract state spaces 
in Section [ 



3.1 Cones and ordered vector spaces 

We start this section with some intuition about cones in a physical theory. 
Suppose that the set of normalized states of a physical theory is given by a 
convex subset Qa of a vector space A (we will make this assumption in Section 
13. 2(1 . In quantum theory, for example, this is the set S(%) of density operators 
on a Hilbert space H, and the normalization is given by the trace of the operator. 
As we will see in Sections 13.41 and 13.61 it is often useful not only to deal with 
normalized states but also with subnormalized states. In quantum theory, this 
means that it is useful to consider positive operators p with tr(p) e [0, 1] instead 
of density operators (with tr(p) = I) only. This extends the set of states under 
consideration from Ha to the set fi^ = {auj 6 A | a € [0, 1], u G Ha}- Figure 
13. II shows this geometrically. 

It is mathematically very convenient not only to consider rescalings with 
scalars between and 1 but to consider a whole cone. In this picture, a cone 
corresponds to rescalings of Ha with all positive scalars. As we will see below, 
this allows us to make use of mathematical structures like orders, dual orders, 



32 



order intervals and more. 

We start our definitions with the definition of a cone. Before we do this, it 
is convenient to introduce the following notation. 




Figure 3.1: This figure gives a geometric picture of the set of subnormalized 
states. Here, the set of normalized states SIa is assumed to be a square (gray). 
The set of subnormalized states ilj is given by all rescalings of VI a with scalars 
between and 1. 



Notation: Throughout this thesis, we will use the following abbreviations. For 
any two subsets M and N of a real vector space and for any scalar a 6 1, wc 
denote 

M + N :={ra + n|m€ M, n £ N} , 

olM := {am \ m £ M} . 

Definition 3.1: Let V be a real vector space. A nonempty subset K of V is 
called a cone in V if the following conditions are satisfied] 11 ] 

• K + K = K, (3.1) 

• aK = K Va > 0, (3.2) 

• K n (-K) = {0}. (3.3) 

The conical hull of a subset M of V is given by 






conc(M) = < 2, a i x i 



e{0,l,2,...},x» 6M,«,>0 



It is easy to verify that for any subset M of a vector space V, the set 
cone(M) is a cone in V. Cones have an intuitive geometric picture. Figure [3~T1 
shows examples of cones. Clearly, cones are convex sets. We will see below that 
there is a one-to-one correspondence between vector spaces with a cone and 
ordered vector spaces. The latter is given by the following two definitions. 



11 In the literature, a cone is sometimes denned by property l|3.2j l alone. In this case, a cone 
satisfying I l3.il! is called a convex cone, and a cone satisfying 1 13. 31 1 is called a salient cone. 
We follow the definition in IAT07] which coincides with our Definition 13.11 
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Figure 3.2: A few examples of cones. For the sake of illustration, the cones have 
been truncated. They are actually infinitely high. The round cone on the very 
right is sometimes called the ice-cream cone (for the obvious reason). 

Definition 3.2: A partial order is a binary relation "< " over a set P which 
is 



• 


reflexive: 


a < a Va G P , 






(3.4) 


• 


antisymmetric: 


x < y and y < x 


=> 


x = y, 


(3.5) 


• 


transitive: 


x < y and y < z 


=> 


X < Z . 


(3.6) 



Sometimes, we will write y > x for x < y. 

Definition 3.3: A partially ordered vector space or ordered vector space 

is a real vector space V and a partial order "< " over V such that the following 
properties are satisfied: 



x < y 
x < y 



ax < ay 

x + z < y + z 



Vx,yeV, Va > , (3.7) 

Vx, yi z£V. (3.8) 



This allows us to state a proposition which gives us a geometric picture for 
ordered vector spaces. 

Proposition 3.4: Cones and ordered vector spaces obey the following corre- 
spondence: 

(a) If if is a cone in V, then x <k y '-^ y — x (z K defines a partial order on 
V which turns V into an ordered vector space. 

(b) If (V, <) is an ordered vector space, then V+ := {v 6 V | v > 0} defines a 
cone in V. 



Proof, (a) We have to check the reflexivity, antisymmetry and transitivity of 
<k as well as the properties (|3.7JI and (|3.8|) . Reflexivity is given if x <k x, 
which means that x — x — G K . This is true since the zero vector is 
always an element of a cone by property (|3.2[) or (|3.3[) . For antisymmetry, 
we need that x — y e K and y — x — — (x — y) £ K implies x = y. This is 
true by property ()3.3|) of a cone. Transitivity holds because property ([3.1 [I 
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says that y — x G A and z — y 6 A implies z — x = (z — y) + (y — x) G A. 
Property (|3.7[) follows directly from (|3.2[) and (J3.8I) follows from y — x G 
K =*> (y + z)-(x + z)€ K. 

(b) We have to check (|3.ip - (I3.3[) . (|3.1|) is immediately seen by noting that by 
(|3.8I) . we have that a; > and y > imply a; + y > and therefore x + y G A\ 
In a similar way, p. 21) follows from p. 71) . For (|3.3p . suppose that a; G A" 
and a; G —A, i.e. a; > and —x > 0. Then by (|3.7[) . we have that 2x > 0. 
Now apply (|3.8[) to get 2a; + (— x) > + (— x), i.e. x > —x. Analogously, 
we get —x > x. By the antisymmetry of the order, we get x ~ —x and 
therefore x — 0. We have inferred x = from x G A and a; G — A which 
shows An (-A) = {0}. 

□ 

This correspondence allows us to define the cone order and the positive cone. 

Definition 3.5: Let (V, <) be an ordered vector space. The positive cone V+ 
of V is given by 

V + := {« € V | v > 0} . 

Conversely, let V be real vector space and let K be a cone in V. The cone 
order "<k" on V induced by K is given by 

x <k y :•** y — x € K . 

If it is clear from the context by which cone the order is induced, the subscript 
A is dropped and we write "<" instead of "<k"- Moreover, we write x < y for 
(x < y and x ^ y), x > y for y < x and x > y for (x > y and x =/= y). 



Proposition 13.41 means that specifying an ordered vector space is equivalent 
to specifying a cone of a vector space, and we can refer to these two notions 
interchangeably. 

There are many easily constructible examples of cones in R™, as suggested 
in Figure 13.11 Instead of explicitly writing down such a cone, we make a more 
abstract example. 

Example 3.6 (The cone of positive operators on a Hilbert space): Let H be 
a finite-dimensional Hilbert space. The Hermitian operators on 1-L form a real 
vector space Herm(H). Consider the subset of positive operators on H. In 
comparison to the notation above, we have 

V = Herm(H) for some finite-dimensional Hilbert space T-L , 

V+ = {T G Herm(H) | (v\T\v) > \/\v) G H} . (3.9) 



It is easily verified that the positive operators fulfill the requirements (j3.ll) 
(|3.3| . i.e. that the positive operators indeed form a cone in T-L. The name of 
positive operators already suggests that this cone is induced by an order on 
Herm('H). The order is the usual operator order, given by 

T <U :<£> (v\U-T\v)>0 V\v)eH. (3.10) 
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This example illustrates the close connection between ordered vector spaces and 
vector spaces with a cone. The conditions (|3.9[) and p,10[) are very similar. It 
would be quite artificial to keep these two structures apart. I 

The next definition that we make will be particularly important in the con- 
text of measurements on abstract state spaces in Section 13.31 



Definition 3.7: For two elements x,y £ V of a partially ordered set V (in 
particular of an ordered vector space V ), the set 



i v 
is called the order interval from x to y 



{z <E V \ x < z < y} if x < y 
ifx^y 




n 





Figure 3.3: An order interval can be visualized as the intersection of an up- 
ward and a downward cone. For the sake of illustration, the cones have been 
truncated. 

Another important concept in the context of measurements on abstract state 
spaces is the fact that a cone (an order) in a vector space V induces a dual cone 
(a dual order) in the dual space V* of V. Before we define the dual cone (order), 
it is convenient to define what a positive linear functional on an ordered vector 
space is. 

Definition 3.8: A linear functional f G V* on an ordered vector space V is 
positive if f(v) > Vi> 6 V+. The functional f is called strictly positive if 

f{v) >0 for allveV+\{0}. 

Definition 3.9: Let K be a cone in a real vector space V . The dual cone of 

K, denoted by K* , is given by 

K* := {f EV* \ f{x) >0VxeK}. 

Equivalently, the dual cone is the set of all positive linear functionals, where K 
is regarded as the positive cone V+. By Definition \3.5l K* induces an order 
<k* on V* , which is called the dual order. 
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Example 3.10 (The dual cone of the positive operators and POVM elements): 
At this point, it is very instructive to reconsider the positive operators from 
Example 13.61 A very natural way to look at the dual space of Herm(H) is via 
the Hilbert-Schmidt inner product 

(T,17>Hs:=tr(Ttl7)=tr(TJ7). 

By the Riesz Representation Theorem, every vector T in Herm('H) (i.e. every 
Hermitian operator T on T-L) induces a linear functional on Hcrm('H), 

f T : Herm(-H) -> K 

U i-> tr(TU) 

This representation of linear functionals on Herm(H) by elements of Herm('H) 
naturally identifies Herm('W) with its dual space (Herm('H))*. It does even more: 
It turns out that the linear functional fr is positive in the sense of Definition ^. 81 
if and only if T is a positive operator in Herm(H), i.e. T £ V+ = {T £ Herm('H) 
(i>|T|i;) > y\v) £ H}. This means that the Hilbert Schmidt inner product, 
via the Riesz Representation Theorem, identifies the cone of positive operators 
V+ C Hcrm('H) with its dual cone VI C (Herm('H))*. By the equivalence of 
cones and orders, this also identifies the dual order with the order. This allows 
us to regard a positive operator as that what it is (a positive operator) as well as 
the linear functional associated with it. This expresses the fact that the cone of 
positive operators is strongly self- dual 12 ]. To make this more clear, we consider 
the set of POVM elements on "H. As we will see in Section I3~3l in the context of 
measurements, it is natural to consider order intervals of the dual cone. Here, 
the dual cone is (Herm('H))*. If we denote the identity operator on H by /, 
then, as we will see below, the relevant order interval in quantum theory is the 
oder interval [0,//] in (Herm(W))*. We have that 

[0, //] = {/ £ (Herm(H)r I < f(T) < /r(T) VT £ Herm(H)} . 

If we make use of the Hilbert Schmidt inner product and the Riesz Represen- 
tation Theorem again, we can identify the order interval [0,//] in (Hcrm(7^))* 
with the order interval [0,/] in Herm(H). It reads as follows: 

[0, 1] = {T £ Hcrm(H) | < T < 1} . 

The order interval [0, 1] is exactly the set of POVM elements on the Hilbert 
space T~L. This way of making the functional behavior of the maps fy implicit 
by treating them as positive operators is very common and convenient. In fact, 
without this identification, we would have to treat POVMs as functionals rather 
than as operators. ■ 



12 A positive cone V+ is said to be strongly self-dual if there is an invertible linear map 
4> : VT —} V4. which is symmetric and positive, i.e. f(<j>(e)) = e(<t>(f)) for all 6, / £ VT and 
e(</>(e)) > for all e £ V* JGB Blll . We will not go into more detail concerning self-duality 
of cones. 
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The next concept we are going to investigate is the notion of a base of a 
cone. 

Definition 3.11: Let K be a cone in a vector space. A nonempty convex subset 
B C A'\{0} is said to be a base 13 ] for the cone K if for each x 6 K\{0} there 
exists A > and b € B both uniquely determined such that x = Xb. 

It is important to notice that not every cone has a basis. For example, 

V+ = {(x,y) e R 2 | y > 0} U {(x,y) e M 2 | y = 0,x > 0} 

is a cone in R 2 but it has no base. The following theorem gives a precise 
characterization of the cones which allow for a base. 



Theorem 3.12 ( [AT07L Theorem 1.47]): A positive cone V+ of an ordered 
vector space V has a base if and only if V admits a strictly positive linear 
functional. More precisely, a subset B of V+ is a base of V+ if and only if there 
is a strictly positive linear functional / e V* and a a > such that 

B = {v eV + \ f(v) — a} (see Figure 




Figure 3.4: A visualization of Theorem 13. 121 



Example 3.13 (Density operators as a base for the cone of positive operators): 
Once again, we consider the vector space V — Herm(H) of Hermitian operators 



13 This notion of a base has to be clearly distinguished from the base of a pyramid as 
introduced in Examplc l2.40l In both cases, it is very common to call it a base, so we do not 
want to alter the terminology here. To make a distinction between the two, we denote the 
base of a pyramid by a normal B and the base of a cone by a calligraphic B. 
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on a finite-dimensional Hilbert space with the cone of positive operators (c.f. 
Examples l3.6l and l3.10p . The linear functional /j : T h-> tr(T) is strictly positive, 
as one can easily see: Every positive operator has an eigenbasis and has only 
non-negative eigenvalues, so its trace is non-negative and vanishes if and only if 
it is the zero operator. According to Theorem 13. 12[ this means that 

S(H) = {p£ Herm(H) | p positive, tr(p) = 1} , 

which is nothing but the set of density operators on H, is a base for the cone of 
positive operators. I 

In a finite-dimensional vector space, it is sometimes useful to make use of 
very basic topological properties of cone bases. Recall from Theorem 12. 131 that 
in a finite-dimensional vector space, we do not have to specify a topology since 
in this case, there is a canonical topology compatible with the vector space 
structure. In Section I3T21 we will make use of the following fact. 



Theorem 3.14 ([Bar02, Chapter II. 8]): Let if be a cone in a finite-dimensional 
real vector space V which has a compact base. Then K is closed. 

Closed cones in finite-dimensional vector spaces are particularly neat because 
they show a certain duality property concerning their dual cone. The next 
proposition makes this statement more precise. 

Proposition 3.15: Let V be a finite-dimensional real topological vector space, 
let K he a closed cone in V. Then, the double dual cone (K*)* in (V*)* = V is 
identical to K. 

Proof. Recall that V can be canonically identified with (V*)* via (v*)*(f) = 
f(v). Taking this into account, we have 

K* = {/ G V* | f{v) >0VveK} 

(K*)* = {v e V I f{v) >0V/e K*} 

from which one can see that K C (K*)* . Thus, we have to show that (K*)* C 
K. Let V be equipped with any norm (which necessarily induces the topology 
on V since V is finite-dimensional). Let w G V, w £ K. K \s closed and convex, 
so by virtue of the Hahn-Banach Theorem, there is a g G V* with 

g(w) < M{g(v) \v&K}-.k. (3.11) 

We have that the zero vector is contained in K, so k < since g(0) — 0. We 
cannot have k < 0: If k < 0, there is a v G K such that g(v) < 0. Multiplying 
v with a large enough scalar a > 0, we would have that g{av) < g(w) while 
av G K, which contradicts Inequality p. lip . Therefore, k = which implies 
that g G K* . We have shown that w ^ K implies the existence of a linear 
functional g with g(w) < 0, so w $l (K*)* . This means that (K*)* C K which 
completes the proof. □ 
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Sometimes, it makes an important difference whether the vector space which 
contains a cone is chosen "too big" . For example, one might consider the non- 
negative y-axis as a cone in R 2 . Another example is an ice-cream cone (c.f. 
Figure 13. 1[) as a cone in R 4 . In both cases, the linear span of the cone is a 
proper subspace of the vector space. In some contexts, one wants to exclude 
this case by requiring that the cone is generating. A cone is generating if its 
linear span coincides with the vector space containing the cone. Noting that for 
a cone A, we have that span(A') = A — A = {x — y | x, y € A}, this gives the 
following definition. 

Definition 3.16: A cone K in a vector space V is called generating if K—K = 

V. 

The next notion we want to explain is the notion of an order unit. 

Definition 3.17: Let V be an ordered vector space with positive cone V+. A 
vector e € V+ is called an order unit in V if for each v £ V there exists some 
A > such that v < Xe. For the dual space, if f € V* is an order unit in V* 
(with respect to the dual order), we might also say that f is an order unit on 

V. 

In the context of abstract state spaces, where one considers closed and gen- 
erating cones in finite-dimensional vector spaces (c.f. Section l3.2|) . the terms 
"strictly positive linear functional" and "order unit" are used synonymously. 
The following Theorem explains why one can do so. 

Theorem 3.18 ( AT07, Theorem 3.5]): For a closed and generating cone V+ 
in a finite-dimensional vector space V and for some / £ y*, we have that / is 
strictly positive if and only if / is an order unit in V* , i.e. an order unit on V. 

This concludes our mathematical introduction to cones and ordered vector 
spaces. 

3.2 The abstract state space 

In this subsection, we develop a particular kind of framework of generalized 
probabilistic theories which has also been called the abstract state space formal- 
ism [BBLW08J . [BGW09] , |BW09aj . |BW09bj . The introduction to this frame- 
work given here is not found in other references dealing with abstract state 
spaces. It reflects the view of the author of the present thesis and should not be 
regarded as a standard introduction. To our knowledge, such an introduction 
to abstract state spaces has not been published so far. 

The framework relies on the following four central notions: probability, sys- 
tem, state and measurement. These notions will not be further specified here. 
Their meaning is assumed to be given. However, it is conceptionally important 
to notice that these notions do not have an independent meaning but only make 
sense in the context of each other. We will infer the framework from a number of 
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assumptions. Those assumptions are part of the framework of an abstract state 
space. To distinguish these framework-based assumptions from the assumptions 
that we will make in Sections HI and [51 we call them "Assumptions", whereas we 
will refer to the assumptions in Sections 0] and [3] as "Postulates" . 

We start the derivation of an abstract state space by the specification of the 
set of normalized states. As we have said above, we do not specify here what a 
state is but we assume that the meaning of this notion is given. However, we 
will explain what the term "normalized" stands for. We will explain at the end 
of the section why one can call them normalized from a mathematical point of 
view. In Section 13.41 we will explain the physical interpretation of the state- 
normalization. For now, we might think of the set of normalized states as the 
set of those states of a system which are not conditioned on any event, whereas 
we will interpret the subnormalized states as states which are conditioned on a 
preceding random process. 

The first assumption that we make is that the set of normalized states is a 
convex set. We have already motivated this assumption in Section |2~T1 There 
we said that we want a set of states to be convex because we want to treat all 
probabilistic mixtures of states in a consistent way. In Section 12.21 we saw that 
from a very general and abstract point of view, this leads to the notion of a 
convex space. We mentioned that roughly speaking, convex spaces split into 
probabilistic and possibilistic spaces. Our concern are probabilistic theories, 
i.e. theories where the set of normalized states is a convex subset of a real 
vector space. According to the Theorem 12.21 by Stone, this is equivalent to 
assuming that the set of normalized states is a cancellative convex space. This 
is our first assumption. In the literature, the cancellation property is usually 
not mentioned, but the set of states is assumed to be embedded in a vector 
space without further comments. We state the assumption of the cancellation 
property explicitly. 

Assumption 1: For any system A, the set of normalized states is a can- 
cellative convex space. In other words, the set of normalized states is a convex 
subset of a real vector space. 

The next assumption is very common not only in the framework of abstract 
state spaces but also in most (if not all) frameworks of generalized probabilistic 
theories that have been considered so far. It is of a purely technical nature, 
used to make the mathematics involved feasible. There is no immediate physical 
reason to make this assumption. 

Assumption 2: The real vector space containing the convex subset of normal- 
ized states is finite- dimensional. 

With these two assumptions, we have a finite-dimensional real vector space 
at hand. Recall from Theorem 12.131 that in this case, we have a canonical 
topology on the vector space. This allows us to refer to topological properties 
of the space without explicitly specifying a topology. The next assumption that 
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we make is of topological nature and reads as follows. 
Assumption 3: The set of normalized states is compact. 

From a mathematical point of view, this assumption facilitates a few tech- 
nical issues. It also has the interesting physical consequence that, by virtue 
of Theorem 13.141 and Proposition 13.151 it establishes some kind of duality be- 
tween states and measurements. We will come back to this issue in Section 
13.31 However, the physical motivation for this assumption is not completely 
undisputable. By Assumption [51 we are in the finite-dimensional case, where 
the question of compactness divides into the questions of closedness and bound- 
edness. We cannot give completely clear reasons for these two assumptions, but 
at least we want to say something about the closedness. Assume that a physical 
system can be prepared in a way such that certain statistics of measurements 
can be approximated arbitrarily well. In other words, assume that the state 
of a system can be prepared in states which are arbitrarily close to a certain 
"state" . If one takes up the position that in this case, the approximated "state" 
should indeed be regarded as a state as well, then the assumption of closedness 
becomes natural. A more formal and mathematical argument would relate this 
to the fact that in a Hausdorff space, the set of all points which a series in a 
set can converge to is given by the closure of the set. We do not want to be 
dogmatic about this assumption. We make the assumption of compactness for 
technical reasons. 

In the following, we motivate the abstract state space structure from As- 
sumptions []] [2] and [3] To have an idea where this is going, we define what we 
mean by an abstract state space. 

Definition 3.19: An abstract state space is a tuple (A,A + ,ua), where A is 
a finite- dimensional real topological vector space, A + is a closed and generating 
cone in A and ua is an order unit in A* (i.e. an order unit on A). 

Here, in order to keep the introduction to the abstract state space formalism 
concise, we only give a rough picture of how a set of normalized states gives rise 
to an abstract state space at this point. For a precise mathematical proof of the 
equivalence of these two structures, we refer to Appendix [Bj The proof is quite 
lengthy and is not necessary for an intuition for abstract state spaces, so we do 
not include it in this section here. 

We start with a set of normalized states which satisfies Assumptions [I] [2] and 
[3J Assumption [T] states that the set of normalized states is a cancellative convex 
space. In Section |2"T21 we have seen that by Theorem 12.21 (Stone), this means 
that we can assume that our set of normalized states is a convex subset of a 
real vector space. By Assumption [21 this vector space is finite-dimensional. By 
Theorem 12.131 we have a canonical topology on this vector space. Assumption 
[3] states that in this topology, the set of normalized states is compact. We want 
to see that the set of normalized states can be seen as the base of a closed cone. 

To see this, visualize a convex set Qa in a vector space A. Say that the afhnc 
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hull aff(O^) of this set has dimension d. (For the sake of illustration, imagine 
that Ha is a square, i.e. d — 2.) Assume that the vector space A which contains 
£Ia has dimension d + 1. (In the case where Qa is a square, this means that A 
is three-dimensional.) Imagine that VLa is placed "somewhere above the origin" 
of the vector space A (in particular, Q a does not contain the origin). In this 
case, the set Qa generates a cone: The set A + — {auj e A \ a > 0,uj e Ha} is 
a cone in A (see figure I3.5[) . 



supcrnormalized "s,t'atcs" 




Figure 3.5: This figure visualizes how a compact convex set of normalized states 
forms the basis of a cone. The rescalings of Ha with factors between and 1 
form the set of subnormalized states H^ . The rescalings with factors larger 
than 1 (the supernormalized "states") are not physical, but they are elements 
of the cone A + . 

The cone A + is generating since we have assumed that the vector space 
is of only one dimension higher than the set of normalized states. The set of 
normalized states Ha is a base of the cone A + . This cone is closed by virtue of 
Theorem I3H1] since Ha is compact. By Theorem l3.121 there is a strictly positive 
linear functional, which we call ua, such that Ha = {w G A + | ua{u) = 1} (see 
Figure I3.6[) . In Theorem 13.181 we have seen that we can equivalently say that 
ua is an order unit in A* , or an order unit on A. 

This gives us an abstract state space (A, A + ,ua)- We have only given a 
rough picture here. We have not proved our claims. Moreover, we have not said 
whether the abstract state space (A 7 A + ,ua) constructed from the set Ha is 
unique. Conversely, we have not answered the question whether every abstract 
state space in turn gives rise to a compact convex set of normalized states 
Ha- Roughly speaking, it turns out that both questions can be answered in 
the affirmative. There is a one-to-one correspondence between compact convex 
subsets of finite-dimensional vector spaces and abstract state spaces. For more 
details, we refer to Appendix [Bl 

In the following, when we talk about generalized probabilistic theories, we 
will always work in the abstract state space formalism. We make the following 
definitions. 
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Figure 3.6: This figure visualizes the role of the order unit ua in an abstract 
state space. 



Definition 3.20: For an abstract state space (A, A + ,ua), we define the set of 
normalized states by 

H A := {oj e A+\ u a {oj) = 1}. 

In analogy to quantum theory, the extreme points of Ha are called pure states. 
We define the set of subnormalized states H^ by 

f^ 1 :={u(=A + \u a (uj) <!}. 

The interpretation of H^ will become clear in Section 13.41 For an abstract 
state space (A,A + ,ua), we will often refer to the abstract state space merely 
by A instead of the whole tuple (A, A + , ua)- 

Example 3.21 (Quantum theory): We already know all structures that describe 
quantum theory as an abstract state space from the Examplcs l2.5[l3~Bll3.10l and 
[331 

• A — Herm(%) for some finite-dimensional Hilbert space H , 

• A + = {Te Hcrm('H) | T > 0}, the set of positive operators on H , 

• u A = // : T m. tr(T) , 

• Ha = S(TL) , the set of density operators on H. . ■ 

So far, we have used the term of a set of normalized states without explaining 
the reason for that. Now, we argue mathematically about this notion. Given 
the positive cone A + of an abstract state space A, the set Ha is completely 
characterized by the order unit ua on A. It is not a priori clear why a linear 
functional should give rise to a set which is called normalized, since one would 
expect that this is associated with a norm. It turns out, in fact, that given 
A + C A and ua (and therefore Ha), there is a canonical choice of a norm on 
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A. We can get this norm by denning a norm || • ||a* on A* induced by CIa and 
then take the dual norm | • \\a on A. The norm || • ||a* is defined by 

||/|U»= sup |/(w)|. (3.12) 

Proposition 3.22: If A+ C A is a generating cone, ua is an order unit on A 

and Qa — {uj ^ A+ \ ua{w) = 1}, then ||/||.a* = sup |/(^)| defines a norm on 

A*. 

Proof. We have to verify that || • ||a* satisfies (a) positive homogeneity, (b) the 
triangle inequality and (c) definiteness, which are all very easy to check. 

(a) : ||a/|U* = sup |a/(w)| = |a| sup |/(w)| = |a| \\J\\a* 

u£Q,a uj^Oa 

(b) : ll/ + .9lU*= sup \f(u)+g(w)\< sup \f(uj)\ + \g(uj)\ 

uj^VLa lu££1a 

< sup |/H|+ sup | ff (w')| = H/IU* + ||5lU* 

lu^Qa lj' £Q,a 

(c) : I l/l I = => Cl A C ker(/) =► span(Q A ) C ker(/) 

=> ker(/) = A (since span(ri A ) = A) =» / = □ 

We will make use of the norm 1 1 • 1 1 ^« in the proof of Proposition l3.40l below. 

Definition 3.23: For a normed vector space V with norm \\ ■ \\v , the dual 

norm \\ ■ \\y on V* is given by 

||/||v. :=sup{\f(v)\\v&V : \\v\\ v <l}. (3.13) 

It is not difficult to check that this gives indeed a norm. Substituting V = 
A* in this definition, we obtain a norm | • \\a on A. Comparing (|3.13|) with 
CIa = {cj e A + J ua{oj) — 1}, we see that 



||w||a = «a(w) Vu; G A 



This explains, from a mathematical point of view, how it is justified to call CI a 
the set of normalized states. We will see in Section 13.41 how the states in CI a 
are interpreted as opposed to subnormalized states in Cl^ . 

3.3 Measurements on abstract state spaces 

We have seen in Section l3~2"1 that the specification of a compact convex set of nor- 
malized states gives rise to the structure of an abstract state space. This struc- 
ture basically adds the state normalization to the framework. In this section, 
we will see that the abstract state space in turn gives rise to the mathematical 
structure of measurements. 

Suppose that a measurement AA is performed on a system. This measure- 
ment has some finite set of possible outcomes Im = {1, . . . , n}. In the abstract 
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state space formalism, M is represented by a set of functions M. = {ei, . . . , e„}, 
where ek is associated with the measurement outcome k £ Xyvi ■ These functions 
are to be interpreted as follows. Suppose the system, prior to the measurement, 
is in the initial state ui. If the measurement is performed on the system, then 
efe(w) is the probability that the measurement outcome is k. To allow a physical 
interpretation, these functions have to satisfy four properties. In the following, 
we discuss these four consistency properties. 

The first property links to the idea that we have explained in Section 12.11 
Suppose that the system undergoes a random process 7?.. This random process 
has the possible outcomes 1 and 2 which have the probabilities p\ and p%. If 
the outcome is 1, the state of the system after the random process is tui, in the 
other case it is u>2- If we do not know the outcome of the random process, we 
have argued in Section 12.11 that we would describe the state after the random 
process by the probabilistic mixture uj — pxOJi +p2^2- Assume that after the 
random process, we measure the system with respect to M. = {ei, . . . , e„}. The 
probability for the measurement outcome k is given by efe(w) = ek{piui\ +P2W2). 
Recapitulate this situation. We do not know the outcome of the random process 
1Z which brings us to take the probabilistic mixture of the state. Then we apply 
efe to get the probability for the measurement outcome k. From a physical point 
of view, however, there is no reason why we should not calculate the probability 
for outcome A; by first calculating it given that we know the outcome of 1Z and 
then take the probabilistic mixture of the probabilities efc(wi) and e/^o^)- In 
other words, it is not physically determined "where to mix" . Therefore, we 
expect that 

ekiPi^i +P2U2) =Piek(ui) +p 2 e k (uj). (3.14) 

In Section [2Tj we have seen that in the case of quantum theory, the linearity of 
the expression 



Pi 



= ^2 vkPi\k = ^2 pk tr (QiPk) 



(in other words the linearity of the function [p/. M> tr(Qipk)}) allows us to 
regard probabilistic mixtures of states to be equivalent to probabilistic mixtures 
of probabilities. But in a generalized probabilistic theory, this is not a priori 
given. This means that in order to treat probabilistic mixtures in a consistent 
way, we have to assume that Equation (|3.14j) holds, i.e. that the functions ek 
are convex-linear. 

The physically reasonable domain of the functions ek is the set of subnormal- 
ized states fl^ of an abstract state space A. (For the mathematical convenience, 
we will treat them as functions ek '■ A — > R below.) The second property that 
we require from the functions e/. is that they map the zero vector 6 A to zero. 
The reason for that is that (as we will see in Section I3~4l we interpret the state 
G A as the state of a system which is conditioned on an impossible event. The 
joint probability of an impossible event and some other event must necessarily 
vanish. The first two properties that we have discussed imply that the functions 
ek are linear functionals (as we will see below). 
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If we want to interpret the values efc(u>) as probabilities, then we have to 
require that 

< efc(w) < 1 Vw g ft^ 1 . 

Equivalently, we can require that (given the e& are linear) 

0<e fc (w)<l Vwefii. 

This is the third property that we demand from the functions e^. Finally, it is 
very natural to assume that if we perform a measurement, it is certain that we 
get some outcome. This leads to the fourth consistency property: 

y ekiui) = 1 Vw G Qa ■ 

These four consistency requirements are necessary for a physical interpreta- 
tion of the functions {ek}kei M - However, it is not clear why these requirements 
should be sufficient in the sense that any set of functions {ek}kei M which sat- 
isfies the four conditions should correspond to a physical measurement. In the 
abstract state space formalism, it is assumed that any set of functions {ek\k^i M 
satisfying the above mathematical requirements corresponds to a physical mea- 
surement. It should be pointed out that this causes a loss of generality of the 
framework. One might think of physical theories where not all mathematically 
defined measurements are possible, and such theories are not encompassed by 
the abstract state space formalism. 

We state our assumption explicitly. 

Assumption 4: Any finite set M — {ek}kex M of functions ek : ^ — > K for 
which 

• each ek is convex-linear on J7^ , (3.15) 

• e fe (0)=0 \fkel M , (3.16) 

• 0<e fc (w)<l VweO^Vfeelx, (3.17) 

• Y^ e <tH = l Vw e n A (3.18) 

corresponds to a physical measurement. For an initial state uj G £Ia prior to the 
measurement, the value efc(w) is the probability that a measurement with respect 
to M. = {ek}kei M gives the outcome k G Im- 

Proposition 3.24: Let A be an abstract state space, let M = {ek}k^i M be a 
set of functions ek : £1^ ~~* ^- Then M = {ek}kex M satisfies properties (|3.15p . 
(|3.16|) and (|3.17[) if and only if every function ek extends to a linear functional 
e-k G A* which lies in the order interval between the zero functional and ua in 
the dual cone, i.e. ek G [0,1^] G A* . Property p. 181) is satisfied if and only if 

Efcex.M ek = UA ~ 
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Proof. We know from Theorem l2.19l that every convex-linear function is a linear 
function plus a translation. This implies that every convex-linear function which 
leaves the origin invariant is linear. Therefore, (I3.15[) and (|3.16[) imply that 
the ek extend to linear functionals, so we can say ek G A* . The inequality 
< efc(w) < 1 for all u> G ft a can be rewritten as < efc(w) < Ua(w) for 
all u) G ft a- The set ft a is a basis of A+, so {aui \ a > 0,ui G ft a] = A + . 
Thus, by the linearity of the functionals efe, the condition (|3.17|l extends to the 
whole cone, i.e. < efc(w) < iiyi(w) for all w G A + . This means that (|3.1Tp 
implies that the efe lie in [0,ua] := {/ € A* | < / < ua} C A* in the dual 
order. Conversely, it is readily verified that every element of [0, ua] satisfies the 
properties (|3.15p , (|3.16|l and (|3.17[) . By an analogous argumentation, (|3.18[l is 
equivalent to J2k£T M e k — U A- d 

Functions that satisfy the properties (j3.15[) . (|3. 16|) and (|3.17|l are commonly 
called effects. The above leads us to the following definition. 

Definition 3.25: For an abstract state space A, we define the set of effects by 
the order interval Ea '■= [0, ua] = {/ G A* < / < ua} in the dual order. An 
element e G Ea is called an effect. A measurement is a set M. — {ei, . . . , e n } 
of effects such that Y^k=i e fc = U A- 

It is easy to see that the set of effects Ea is a convex set. It is the order 
interval from to ua and therefore it is the intersection of the positive (upward) 
cone A* + — {/ G A* | / > 0} in A* and the downward cone {/ G A* \ f < ua} 
in A* (c.f. Figure l3~3"l) . Thus, Ea is the intersection of two convex sets and 
therefore convex. 

There is something interesting to notice at this point. The cone A*,, which 
we could roughly call the "effect cone" , is the dual cone of the "state cone" 
A + . The state cone A+ is a closed cone in a finite-dimensional space. By 
Proposition l3.15[ the double dual cone (A* + )* , i.e. the dual cone of the dual cone, 
is identical to the cone A + . This establishes some kind of duality between states 
and measurements. Instead of specifying the triple (A, A + ,ua), one could just 
as well define an abstract state space by the triple (A* ,A* + , ua). In other words, 
one could specify an abstract state space by the definition of the measurements 
instead of by the definition of the states. This might be an attractive idea for 
people who take up the position that measurements are "more operational" than 
states. 

Below, we will often focus on measurements that consist of effects which are 
extreme points of Ea- It is convenient to give these effects a special name. 

Definition 3.26: An extreme point e of the set of effects Ea is called a pure 
effect. A measurement A4 = {ei, . . . ,e n } is called a pure measurement if all 
effects e\, . . . , e„ of the measurement are pure. 

Example 3.27 (POVMs in quantum theory): We know from Example 13.101 
that the dual cone A* + of the cone A + of positive operators can be identified 
with A+. We have also seen that [0,//] ~ [0,/] is the set of POVM elements. 



From the above definition, we get that a measurement in quantum theory is 
given by a set {-P;}™ =1 of positive operators such that Y17=i ^ = I> as expected. 
There is something important to notice. The set of POVMs contains (orthog- 
onal^ projectors. As we will see in Section 14.11 the projectors are precisely 
the pure effects in quantum theory. However, not every POVM element is a 
projector. Every positive operator P with eigenvalues smaller or equal to one 
is a POVM element, and whenever one of the eigenvalues a satisfies < a < 1, 
P is not a projector. We will discuss in Section 14.11 that there are important 
differences between projectors and other positive operators in the discussion of 
post-measurement states. 



Example 3.28 (The polygon models IJGBBTT]): The polygon models form a 
whole class of abstract state spaces. They have been studied in the context 
of nonlocality, but they also provide interesting examples in our context. In 
Section 14.21 we will consider polygon models as counterexamples to properties 
of abstract state spaces that we assume to be physical. 

For every n <E {3, 4, 5, . . .}, there is a polygon model, which is defined by 



A = R/\ 

A + = cone(£7^), where 

O^ = conv({wi, . . . , uj n }) C 1R 3 , where 

( r n cos (2f) \ 



r„, sin 



'2ixi^ 



eir 



(3.19) 



/ 



V cos ^ 

This induces the following set of effects: 

14 We arc restricted to the set of Hermitian operators, in which an operator P is a projector 
if and only if it is an orthogonal projector. 



4') 



If n is even, 

• E"a = conv{0,ei, . . . ,e n ,UA}, where 



U A = 





V 1 J 



sin (fi^s) 



V 



g (IR 3 



/ 



If n is odd, 
• E% = conv{0, ei, . . . ,e n ,UA - &\, ■ ■ ■ ,u A - e n ,u A }, where 

I \ / r„co S (^i) \ 



MA 





V 1 / 



i + ^ 2 



V 



r„sin(^i) 
1 



R' 5 



G (R 3 )* S B 3 . (3.20) 





n = 6 



Figure 3.7: The polygon models for n = 3 to n = 8. The surface (polygon) 
at the top of each model represents the set of normalized states iY\, while the 
crystal-like shape below the polygons represents the set of effects E\. 

The fact that the polygon models are defined in M 3 makes them particularly 
neat because they can be visualized (c.f. Figure [3~7l) . It is easy to see that each 
set of effects Ea is the intersection of an upward and a downward cone (c.f. 
Figure [3~5)l . The polygon model corresponding to n — 3 is precisely a classical 
system with three pure states. We will examine classical systems in more detail 
in Example 13.291 below. The n = 4 polygon model corresponds to a so-called 
gbit (this stands for "generalized bit"). A gbit represents the local state space 
of a frequently discussed bipartite model which is called the PR-box or nonlocal 
box [PRM] . * ■ 
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Example 3.29 (Classical theory): This example is very central, both in prob- 
abilistic theories in general and in this thesis. We say that an abstract state 
space A is a classical theory if the set of normalized states FIa is a simplex. 
The reason why such an abstract state space is called classical is that this al- 
lows us to interpret a state as a classical probability distribution. To see this, 
recall from Proposition 12.341 that each point in a simplex is a unique convex 
combination of its extreme points: 

Qa simplex, ext(P) = {wi, . . . , w n }, w G CIa 

n n 

=>■ 3 unique (pi, . . . ,p n ) such that u = N»i w ij Pi > 0; z_.P l = •"■ ■ 

i=l i=l 

If we interpret u>i , . . . , ui n as mutually exclusive properties of the physical sys- 
tem, then we can interpret (p±, . . . ,p n ) as a probability distribution over these 
properties. The characterizing properties of a classical system are the fact that 
a state represents a unique probability distribution over its pure states and 
that these pure states can be perfectly distinguished, as we will discuss in the 
following. ■ 

In operational terms, we say that some states u)\, . . . , u) n are perfectly distin- 
guishable if we can perform a measurement whose result allows us to determine 
in which of the states wi , . . . , w n the system was prior to the measurement (given 
that it was in one of these states) . In a rigorous form, perfect distinguishability 
reads as follows. 

Definition 3.30: Let A be an abstract state space. We say that states u>i, . . . ,u> n £ 
ft A are perfectly distinguishable if there is a measurement {ei, . . . , e„} C Ea 
on A such that 

ei(ujj) = Sij Vi,j e {l,...,n}. 

The following proposition can also be found in |MDVlll Lemma 24]. 

Proposition 3.31: Let A be a d-dimensional abstract state space. The follow- 
ing statements are equivalent. 

(a) There are d perfectly distinguishable states wi, . . . , Ud £ ^A- 

(b) £Ia is a (d — l)-simplex with ext(17 J 4) = Wii ■ • • ,<^d, i-e. A is a classical 
theory. 

Proof. We prove the two implications separately. 

• (a) =>■ (b): If the states uj\ 1 . . . ,ujd G ft a are perfectly distinguishable, 
then there are effects /i, • • • ,/d G Ea such that fi{oJj) — % for all i,j G 
{1, . . . , d}. If the states wi, . . . , ujd were linearly dependent in A, i.e. if 
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there was an i G {1 , . . . , d} such that 



uii = 2. a kOJk with a.k G K , (3-21) 



fe=i 



then 



/i( w # y^Qfc/i(^fc) 



fc=i 



would lead to a contradiction to (|3.21[) , so the states ui\ , . . . , ujd are linearly 
independent vectors in A. Thus, they form a basis for A. For every 
u! G Qa-, we have that 



\_] h^k for unique numbers fteK, (3.22) 



d 

(jj = 

k=l 

d 

u A (u) = J20k = l, (3.23) 

fc=i 
d 
< A(«) - ]T &/<(«*) = ft . (3.24) 

fc=i 

Equations (|3.22|l . (|3.23|l and (|3.24|l imply that any state ui G ^a is a 
unique convex combination of the d states u>i,...,u)d- This implies that 
CIa C conv({wi, . . . ,Ud}) =>• &a = conv({wi, . . . ,w n }) =4> ext(f2^) C 
{«i, . . . , u^}. We have that aff (f^) = aff (ext^^)) is (d— l)-dimensional, 
so ext(Oji) contains at least d elements. Therefore, ext(O^) = {wi, ...,Wd}. 
We have proved that every element of fi^ is a unique convex combination 
of its d extreme points. By Proposition 12.341 this implies that Qa is a 
(d — l)-simplex with ext(f7^) = {wi, . . . , cod}- 

• (b) =>- (a): If Qa is a (rf — l)-simplex, then ext(n^) = {wi,...,w<j} is 
an affinely independent set of vectors such that conv(ext(fi,4)) = &A- 
Suppose that wi, . . . ,Wd are linearly dependent in A, i.e. 

d 

> ctiCUi = for some real numbers a* . 
i=i 

Then we would have that G aff(nx) because 



»=i ^=i< 
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would be an affine combination of elements in Qa- But the zero- vector 
cannot be an element of aff(fl > i) because aff(fi^) = {u? E A ua{u) = 1} 
but ua(0) — 0. Thus, wi, . . . , u>d are linearly independent vectors in the 
d-dimensional vector space A. This means that they form a basis. Let 
/i,...,/d G A* be the dual basis with respect to w\, . . . , Udj i-e. 

fi{w j )=5 ij . (3.25) 

The only thing we are left to show is that f\, . . . , fa E Ea = {f E A* 
< /(w) < 1 Vu; G Qa}- This follows from Qa — conv(wi, . . . ,u>d) and 
(1X231) : 



w g ri^ 




,i 



Ji.-U i - /a ( > ; 3,^ ) = £ # / fc ( Wi ) G [0, 1] Vfc G {1, . . . , d} 

1=1 ^r 



convex sum 



=► fi,...,fd€E A . □ 

Example 3.32 (Polytopic theories): This is the class of theories for which we 
will derive the results in Part [TT] of this thesis. We say that an abstract state 
space A is a polytopic theory if the set H,a is a polytope. We have already 
seen examples of polytopic theories: The polygon models (Example l3.28p are all 
polytopic theories since a polygon is a polytope. Every classical theory (Example 
13.29)) has a simplex as the set of normalized states, so it as a polytopic theory as 
well. Besides these two classes, one might think of any other polytope serving as 
the set of normalized states, e.g. a cube, a pyramid or any higher-dimensional 
polytope. ■ 

Now that we know how measurements are defined, we want to investigate 
some of their properties. At first, we have a closer look at at some properties 
of pure effects, before we study a few properties of the set of states Ea in the 
case where the set of normalized states £Ia is a polytope. 

Given a pure effect / G Ea , there is always a "complementary" effect / such 
that {/, /} is a pure measurement, as shown by the following proposition. 

Proposition 3.33: Let A be an abstract state space. If / G Ea is pure, then 
/ := ua- f € E A is pure. 

Proof. Let uj G SI a- 

(tt-/)(«) = l-/(w)G[0,l] => u-JeEa. 



r>:-! 



Let g,h e Ea such that Xg + (1 — X)h = u — f. 

=^f = u-Xg-(l-X)h 

= Xu + (1 - A)u - A.g - (1 - X)h 
= X(u-g) + (l-X){u-h) 

6B.4 e£.4 

The effect / is pure, sou — g = u — h = f and therefore g = h = u — f. □ 

We will refer a few times to this kind of complementary effect, so it is prac- 
tical to give it this name. 

Definition 3.34: For a pure effect f £ Ea, the effect f := ua — f is called the 
complementary effect to f . 

Another very central property of a pure effect / is that / is naturally asso- 
ciated with a face Ff of the set of normalized states. This is established in the 
following. 

Proposition 3.35: Let A be an abstract state space, let / ^ be a pure effect 
on A. Then there exists a state u G Jl^ such that f(w) = I. 

Proof. The effect / is nonzero and £Ia is not contained in a linear hyperplang 15 ! 
of V, so there are states on which / is positive. The effect / is a continuous 
real- valued function on the compact set Ha, so there is a maximum of / on CIa- 
Let m :— max ffg (i A /(c). We have said that m > 0, and by the definition of an 
effect, we have that < m < 1. Thus, < t^— < 1. Note that —feE A and 
hf € Ea- We take the convex combination 

2 — m \m J \ 2 — mj \2 

But / is an extreme point of Ea , so— / = /^m = l. □ 

Proposition 3.36: If / G V* is a nonzero linear functional on a compact 
convex set C, then F? 1 := {x E C \ f(x) — m}, where m = max/(x), is a face 

ofC. 

Proof. The functional / is a continuous real-valued function on the compact set 
C and therefore has a maximum on C which we call m. Let v, w € C, < a < 1. 
Suppose that aw + (1 — a)ui € F™, i.e. 

a/(v)+(l-a)/(w) = m. (3.26) 

<m <m 



15 We say that a subset M of a finite-dimensional vector space V is a linear hyperplane if 
there is a nonzero linear functional g £ V* such that M = {v £ V | g(u) = 0}. 
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Equation Q3.26P is clearly satisfied if f(v) = f(w) = m. If f(v) < m (or 
f(w) < m), then, in order to satisfy equation (13.26[) , f(w) > m (or /(«) > m, 
respectively), which contradicts the fact that m is the maximum of / on C. 
Thus, f(v) — f(w) — 1, i.e. v,w £ FY 1 , which implies that FT 1 is a face of C 
(c.f. Definition HH). □ 

Analogously, Proposition ^ . 36l holds in the case where m is the minimal value 
of / on C, but we will not make use of this fact, so FT 1 is considered with respect 
to the maximal value m of / on C. 

Corollary 3.37: Combining Propositions 13 . 35l and 13 .351 one has that for every 
nonzero pure effect /, Ff := Fj is a nonempty face of fl. 

Definition 3.38: For a pure effect f £ Fa, the face Ff associated with f 

is defined by Ff := Fj = {u £ Ha | /(w) = !}• The opposite face is given by 

F f := F T = {uen A \ 7(w) = i} = {uj e n A I f(u) = 0}. 

Example 3.39 (Associated faces and opposite faces in the polygon models): 
We reconsider the polygon models which we have seen in Example 13.281 In the 
case where the polygon has an even number of vertices, the pure effects and the 
associated faces are of a different character than in the case where the number 
of vertices is odd. We consider the two cases separately. 

(a) n is even: In this case, we have the n pure effects ei, . . . , e„ G E\, the unit 
effect ua € Ea and the zero effect G E 7 ^, so alltogether, there are n + 2 
pure effects. Let E\ = E 7 a \{ua, 0} = {ei, . . . , e„}. For each effect eu € E\, 
the face associated with ek is a facet of the polygon, i.e. F Bk is an edge 
of the polygon. Moreover, the complementary effect e~k — ua — e^ always 
coincides with some other effect e/ € E\. Therefore, the face F ek = F-^ 
opposite to F ek is an edge as well. 

As an example, consider the the square, which is the polygon model cor- 
responding to n — 4 (see Figure [3~8]) . For the pure effect e% £ E A , the 
associated face F e3 is an edge. The effect ej complementary to ea is S3 = ei, 
and the face F ea opposite to F e3 is the edge F e3 = F ei . 

(b) n is odd: Here we also have pure effects e±, . . . , e n , ua, £ E\, but in this 
case, these are not all the pure effects. For each of the pure effects e^ £ 
{ei, . . . , e n }, the complementary effect e~k = ua — ^k (which, by Proposition 
I3.33l is a pure effect) does not coincide with some other effect in {ei, . . . , e n }. 
Alltogether, this makes a total of 2n+2 pure effects. The face F ek associated 
with some effect et £ {ei, . . . , e n } consists of only one state, namely w&. On 
the other hand, the opposite face F ek is an edge of the polygon. 

As an example, consider the regular pentagon, which is the polygon model 
corresponding to n = 5 (see Figure I3.9[) . The face F e3 associated with the 
pure effect e% £ E A consists of only the state UI3. The opposite face F es , 
however, is an edge of the pentagon. 



•W 




Figure 3.8: This figure shows the n = 4 polygon model (the square) with the 
face F e3 associated with the pure effect e-$ and the opposite face F e3 (which is 
associated with the complementary effect ej). 



Fe 3 = {^3} 





Figure 3.9: This figure shows the n = 5 polygon model (the regular pentagon) 
with the face F ea = {lo^} associated with the pure effect e% and the opposite 
face F e3 (which is associated with the complementary effect S3). ■ 

Now we want to see what properties Ea has in the case where Ha is a 
polytope. The first thing we prove is that in this case, Ea is a polytope as well. 

Proposition 3.40: Let A be an abstract state space such that Ha is a polytope. 
Then Ea is a polytope. In particular, Ea has only finitely many pure effects. 



Proof. Let A* be equipped with the norm (|3. 121) that we have introduced in 
Section EH 



ll/l 



sup |/(w)| 



Ea is obviously contained in the unit ball in A* with respect to this norm. This 
implies that Ea is bounded. By the definition of Ea, we have that 

E A = {/ e A* I < f(u) <lVwe n A }- 

We can rewrite this as 

E A = {/ e A* I /(w) > Vw 6 O-i} n {/ e A* | /(w) <lVwe n A } ■ 



0(i 



If the two inequalities /(u/j) > and /(wj) < 1 are satisfied for a family {uji}i 
of states in SI a, then they are also satisfied for all states in the convex hull 
conv({uJi}i) of the family. Ha is compact (by the definition of a polytope), so 
by Theorem l2.141 it is the convex hull of its extreme points. Therefore, 

E A = {/ e A* | f(u) >0VLoen A pure} n {/ G A* \ f(u) < 1 Vw G Q. A pure} . 

Qa is a polytope, so it has finitely many extreme points. Thus, the above 
equation implies that Ea is the intersection of a finite family of closed half- 
spaces and therefore a polyhedral set. By virtue of Theorem 12.281 Ea is a 
polytope. □ 

Above, we have established that for every pure effect / G Ea, there is a 
non-empty associated face Ff given by /(w) = 1 O w £ f / for all w G Ha- An 
interesting question is whether the converse is true: If F is a face of Ha, is there 
a pure effect / G Ea such that F = Ff? It turns out that this is not the case. 
A counterexample: If F is a vertex of the n — 4 polygon-model, then there are 
effects / such that F = {u G Ha I /( w ) = 1}, but none of these effects is pure. 
However, there is a weaker version of the statement which is true. 

Proposition 3.41: Let A be an abstract state space such that SIa is a polytope. 
If F C SI a is a facet of VLa, then there exists a pure effect g G Ea such that 
g(bj) = l«w£f, i.e. such that F = F g . 

Proof. Let 

XA = {w G A | maM = 1} , ^a = {^ G ^4+ | ua(w) = 1} . 

Recall from Proposition 13 . 151 that for a closed cone K, one has that (K*)* = K . 
The cone A+ is closed by the definition of an abstract state space. This implies 

A + = {ue A | f{u) >0V/G A* + }. (3.27) 

This allows us to write 

n A = {lu g A+ I ua(w) = 1} ^^ {^ G A | ua(w) = 1, /(w) > V/ G a;} 

= {^G X a|/H>0V/gA;}. (3.28) 

For every / G A^_, there is an a > such that a/ G -EaLj For any lo e xa, one 
has that f(u)) > if and only if af(u) > 0. This allows us to rewrite (I3.28[) : 

n A = {coe xa I /(w) >0V/eE A }. (3.29) 

Note that 

/ i— > ma — / is a bijection on Ea ■ (3.30) 



16 This can be seen as follows. A positive linear functional / £ A*, is a nonnega- 
tive continuous function on the compact set Q.A- Thus, it has a nonnegative maximum 
m := maxugjlj fi 1 ^)- If Wi = 0, then / S E^. If m > 0, then «/ S -Ea with a = — > 0. 
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Therefore, 

/(«)>0«*(u-/)(w)<l (3.31) 

=> n A = {uje X A\ f(u) > V/ e E A } 

= {loExa\ /(w) < 1 V/ £ E A } by (l3"5Ul) and (l3~5TD . 

Note that if /i(w) < 1 is satisfied for a family {/,}, of functionals, then it is 
satisfied for all functionals in the family's convex hull, f(ui) < 1 for all / g 
conv({fi}i). By virtue of Theorem 12. 14[ il A is the convex hull of its extreme 
points, so 

n A = {uj exA\ f(u) < 1 V/ € E A pure} . 

For / = and / = u^, we have that {uj G \A | /( w ) < 1} is all of XA> so 

Oi = W G XA | /(«) < 1 V/ e ^}, where (3.32) 

Si = {/ e ^ | / pure, / ± 0, / ^ Uj4 } . 

By Proposition 13.401 E A is finite. Therefore, we see from f|3.32[) that Vt A is the 
intersection of finitely many half-spaces, where the boundary of each half-space 
is the set of points in \A at which a pure effect has value 1. Lemma |2 .351 implies 
that there is a pure effect g such that g(uj) = 1 for all u G F. 

It remains to show that g(ui) < 1 if ui € Cl A ,ui ^ F. Let a G ^1^, <r <£ F. The 
point a is not an element of aff (F), but aff (F) is equal to {lj G xa \ g(u) = 1} 
since F is a facet (a maximal proper face). Thus, a cannot be an element of 
Q A , so we have a contradiction. □ 

3.4 The physical interpretation of the state normalization 

So far, the only subset of the abstract state space A we have talked about in a 
physical context is the cone-base Q A , which we also call the set of normalized 
states. In Section 13. 2\ we have argued mathematically why this notion makes 
sense. Now we will give a physical interpretation of the "norm" u A (ui) of a state 
uj G £l A . In the following, when we talk about states, we mean elements of the 
set of subnormalizcd states £1^ = {u G A + | u a (lj) < 1}, and when we say 
normalized states, we mean elements of Q A — {uj G A + \ u a (uj) = 1}. 

Suppose that a random process 1Z with the possible outcomes In — {1, . . . , n} 
takes place. Assume that the state of a system is prepared in a state u>k G £l A 
which depends on the outcome k G F-n . In Section 12.11 we argued that if we 
are ignorant about the outcome of the random process, our description of the 
system is given by the normalized state 

n 

ui = ^p k ui k , (3.33) 

fc=i 

where p k is the probability for the outcome fc G I-r,. This is the probabilistic 
mixture of the states {uJk\k£iK G fl A . If we know that the outcome of the 
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random process is m £ %r, i.e. conditioned on the outcome m, we describe the 
system by the normalized state ui m . These two situations (being ignorant about 
the outcome and knowing the outcome) thus lead to the states (13.331) and u> m , 
both of which are meaningful. 

Besides these two treatments of states (which both deal with normalized 
states), there is a third one which is meaningful in the presence of random 
processes (which deals with unnormalized states). Consider the following: 



• 



Suppose that a random process prepares the system in the state u>k with 
probability pk ■ 

Subsequent to the random process, a measurement M. on system A is 
performed. 



In Section [3.31 we have seen that measurement outcomes k £ Im are associated 
with linear functionals efc £ {ek}keX M - If w € £Ia is a normalized state (prior 
to the measurement) and I £ Im is a possible outcome of the measurement 
M, then ei{ui) gives the probability that the outcome of the measurement is 
I. In this context, however, it makes sense to consider a subnormalized state 
a = pk^Jk (instead of a normalized state uj). Notice that pk^k ^ ^A whenever 
Pk < 1. One can interpret this state as follows. 
For ei £ M, we have that 

eiipk^k) =Pk ■ ei{u k ) . 

The value p k is the probability that the outcome of the random process is k. 
The value eiiujk) is the probability that the subsequent measurement gives the 
outcome I, conditioned on the fact that the outcome of the random process was 
k. Thus, ei(pkU>) = ei(a) is the joint probability for the event that the outcome 
of the random process is k and the outcome of the subsequent measurement is I. 
Thus, subnormalized states give us descriptions of joint probabilities. Moreover, 
from the subnormalized state a, we can read out the probability pk separately 
by applying the order unit ua because ua{o~) = UA(j>kWk) = Pk- 

This gives an interpretation for states uj with < ua(oj) < 1. Therefore, the 
set of physically meaningful states is given by the set of subnormalized states 

nf = {ue A + |«a(w)<1}. 

3.5 Pure states and maximal knowledge 

Suppose that a physical system is in some state w £ Ha ■ Assume that we want 
to perform a purer 7 I measurement A4 = {ek}kei M on the system. In general, 
we cannot predict the outcome k £ Im of the measurement with certainty. 
Our prediction of the outcome is given by a probability distribution, and this 
distribution might have probabilities that are neither zero nor one. In a classical 



17 We will see in Section l4.1l how pure measurements are distinguished from other measure- 
ments. 
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theory (c.f. Example 13. 29|) . there is a natural way to interpret this situation. 
When the outcome of a measurement cannot be predicted with certainty, then 
classically this can be interpreted as the circumstance that we do not know 
enough about the state of the system to make a definite prediction|^£| For 
example, we might assume that the physical system is a die. Suppose we are 
sitting on a table. We drop the die on the floor and it rolls under the table 
where we cannot see it. How many pips does the die show? 

We describe the set of normalized states of this system by a 6-simplex. Each 
vertex Uk of the simplex, k = 1, . . . , 6, corresponds to a definite number of pips. 
As long as we do not have a look under the table to find out how many pips 
the die shows, we describe the state of the die by a mixed state. If the die is 
unbiased, we describe its state by the probabilistic mixture u) — Ylk=i | w fe since 
we have no idea which number the die shows. If the die is biased, e.g. with an 
additional weight on the face with one pip, then it might be more likely that the 
die shows one pip, uj = \u)\ + Y2k=2 ~ni u k- We perform a pure measurement on 
this system by having a look under the table to find out how many pips the die 
shows. In both cases (w and uj), we cannot predict the outcome with certainty. 
It is natural to say that we cannot predict the outcome because we do not know 
enough about the die under the table. If the die under the table were in a pure 
state, we could predict the outcome with certainty. We would have maximal 
knowledge about the system. 

For a quantum system, e.g. a spin-i particle, the situation is different. Even 
if the system is in a pure state, for example in the up-state | f z ) with respect 
to the z-axis, there are pure measurements for which we cannot predict the 
outcome with certainty. For instance, if we were to predict the outcome of a 
spin-measurement with respect to the x-axis, we would assign the probability 
i for both of the outcomes "up" and "down" . But in this case, unlike the case 
of a die, the fact that we cannot predict the outcome with certainty can not 
be interpreted as the fact that we do not know enough about the state of the 
system. In the case of a die, the state ui = \w\ + J2k=2 To w fc can De interpreted 
as "with probability ^ the system is in the pure state Wi, with probability 
Yq, the system is in the pure state W2" and so on. In this situation, the fact 
that we cannot predict the outcome with certainty can be interpreted as being 
due to the circumstance that we have incomplete knowledge about the state of 
the system, since we can represent the state as a probabilistic mixture of other 
states. In the quantum case, where the state is given by the pure state | t x ), this 
interpretation does not apply. The state of the system is pure, so it cannot be 
represented as a (nontrivial) probabilistic mixture of other states. We can say 
that we have maximal knowledge about the state of the system. The fact that 
we still cannot predict the outcome with certainty (although we have maximal 
knowledge the state) might therefore be interpreted as an inherent property of 



18 We do not want to be dogmatic about this Bayesian interpretation of probability. The 
goal of this section is to explain that pure states have a physical interpretation that distin- 
guishes them from mixed states. Such an interpretation is also possible from other viewpoints 
regarding probability. We, however, describe the physical distinction of pure states from mixed 
states in the Bayesian picture. 
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the theory, rather than being due to our ignorance about the state. 

This interpretation of pure states as states of maximal knowledge sheds new 
light on Theorem l2.14l This theorem by Minkowski states that a compact convex 
subset of a finite-dimensional vector space is the convex hull of its extreme 
points. In other words, a compact convex set in a finite-dimensional vector space 
is fully characterized by its extreme points. In Section ETSl we have assumed that 
the set of normalized states is a compact convex set in a finite-dimensional space, 
and the extreme points ext^^) are precisely the pure states. In summary, we 
might therefore say the following. 

Pure states are states of maximal knowledge, and the set of states is 
fully characterized by these states of maximal knowledge. 

In other words: 

Every state is a probabilistic mixture of states of maximal knowledge 
(in the case of pure states, this mixture is trivial). 

3.6 Transformations on abstract state spaces 

In this section, we investigate the concept of transformations on abstract state 
spaces. Roughly speaking, a transformation is a map from the set of subnor- 
malized states tt^ to itself which maps an initial state to a final state. 

The way transformations are treated here is different from how transforma- 
tions are normally defined. The reason is that in this thesis, we never consider 
multi-partite systems. We always consider systems of only one constituent. 
Thus, we do not specify how systems are combined to form bi-partite or multi- 
partite systems. This makes a difference in the definition of transformations. In 
the treatment of multi-partite systems, one has to require a consistency property 
of transformations which is called complete positivity. The definition of complete 
positivity depends on the specific way in which multiple systems are combined 
in a physical theory. As we do not specify a rule for how to combine systems, 
we cannot give a definition of complete positivity. Instead, we only require the 
weaker property of positivity (we will explain this property below). Nonetheless, 
this will not lead to problems. Since positivity is weaker than complete positiv- 
ity (no matter how systems are combined) , the class of positive transformations 
is larger than the class of completely positive transformations. Thus, any result 
that is inferred for positive transformations also holds for completely positive 
transformations. 

Transformations naturally arise in two contexts: dynamics and measure- 
ments. In this thesis, we do not deal with dynamics, so we forget about this 
aspect in the following and focus on measurements. When we describe measure- 
ments, there are two aspects involved. If we only want to describe one single 
measurement, we only need to care about one of the two aspects. This aspect 
is the probability distribution of the measurement outcomes, i.e. the measure- 
ment statistic. But if we want to describe consecutive measurements, we also 
need another aspect. We need to care about how a measurement influences the 
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measurement statistic of a subsequent measurement. If we perform two consec- 
utive measurements on a system, it might be the case that the statistic of the 
second measurement depends on the outcome of the first measurement. For a 
full description of how the statistic of the second measurement might be influ- 
enced, we need to specify the state of the system after the first measurement. 
This specification is made by a transformation which maps the initial state to 
the post-measurement state. 

The first of the two aspects that we have just described is fully covered by 
describing a measurement by a set of effects. To meet the second aspect, we 
need to treat a measurement as a set of transformations. Below, we will call 
such a set an operation. 

To discuss the second aspect of a measurement in more detail, suppose that 
we perform a measurement M. = {ei, . . . , e n } with outcomes Im — {1> • ■ • > n} 
on a system. Assume that subsequent to this measurement, we make some 
other measurement J\f with outcomes I/y. If we have a full description of this 
situation, then this means that we can infer the measurement statistic of the 
second measurement for all possible choices of AT. This in turn means that we 
need to have a description of the state after the first measurement, since a state 
is exactly the mathematical object that gives us the measurement statistic for 
every possible measurement Af. Two states of a system are different if and only 
if there is some measurement for which the two states induce different statis- 
tics. This means that a full description of consecutive measurements necessarily 
involves the description of post-measurement states. 

Above, we mentioned that the statistics of a second measurement can depend 
on the outcome of the first measurement. In other words, the post-measurement 
state can depend on the outcome of the measurement. For the moment, we 
describe the transition from the initial state (prior to the measurement) to the 
post-measurement state by a map tk '■ &>a ~^ ^A- This map tk depends on the 
outcome k € Im of the Admeasurement. It takes an initial state and maps it to 
the post-measurement state for the case that the outcome of the measurement 
is k. Thus, if we want to describe both aspects of a measurement AA - the 
outcome statistic of Ad and the influence on the statistic of any subsequent 
measurement M -- we might achieve this by a set of tuples {(ek,tk)}kex M ' 
where the first element of each tuple is an effect e^ and the second element is a 
map tk '■ &>A — > &A- 

So far, we have only talked about normalized initial and post-measurement 
states u 6 ft a- We have not made use of the fact that an abstract state space 
provides the structure to deal with the state normalization. If we use this extra 
structure, we can combine the effect et and the map tk ■ ^a — ► ^A to form 
a transformation T^ which encompasses both aspects of a measurement. We 
achieve this by defining T^{u) = eh(u>)tk{oj). This gives a map T^ : £1^ — > O^ . 
From this transformation X^, we can infer both the measurement statistic and 
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the post-measurement state: 

e k (u) = (u A of k )(uj) 
T fc («) 



t k (u) = 



(u A oTfc)(w) 



To allow concatenations of transformations, we want to extend the map T k '■ 
Qa —> &a to a ma P ^a ~^ ^A ■ For reasons °f mathematical convenience, 
we extend it to a map T& : ft a —> ^a- For a physical interpretation of the 
transformation T k , we have to require four properties of T k . 

The first two properties are the convex-linearity of the transformation and 
that it leaves the origin invariant. These two properties are completely anal- 
ogously to the first two properties that we demanded for effects in Section 
13.31 so we only recall shortly the reasons for these conditions. The convex- 
linearity expresses the fact that there is no physical specification of whether we 
should take probabilistic mixtures of states or of probabilities. Thus, we regard 
them as identical and reach the requirement of the convex-linearity. We require 
Tk(0) = because the zero-state is the state conditioned on an impossible event. 
These two properties together imply (as in the case of effects in Section l3"Uf the 
linearity of the transformation. 

The third and the fourth property arise from the requirement that if we 
restrict T k to the set of subnormalized states $7^ , then we should get a map 
that maps to the subnormalized states £7^ . This requirement splits up into the 
third and fourth property. The third property is the positivity of T/.. 

Definition 3.42: A map (f> : A —> B between ordered vector spaces A and B is 
positive if 4>{A + ) C B + . 

The fourth property is that T k does not increase the norm of the state, i.e. 
ua(T/c(u;)) < 1 for all w G O^ , or equivalently v,A(Tk{u)) < 1 for all u> £ Qa- 
These four properties lead us to the definition of a transformation. To make 
this definition in more generality, we define a transformation as a map between 
possibly different abstract state spaces A and B. 

Definition 3.43: A transformation T : A — > B between abstract state spaces 
A and B is a map which fulfills the following conditions: 

• T is linear. (3.34) 

• T is positive. (3.35) 

• T does not increase the norm, i.e. ub(T(uj)) < 1 for all u> G Qa- (3.36) 

Given that T is linear, conditions US. 35]) and \3.36)) can be summarized as 
T(Q,a) C T(il^ ). The effect induced by the transformation is given by 
ct = ub o T . 

We have described how transformations arise in the context of a measure- 
ment. More generally, we can think of transformations as arising from any sort 
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of random process. Suppose there is a random process, and in the course of the 
random process, a transformation takes place which depends on the outcome of 
the random process. We call this an operation. 

Definition 3.44: An operation O = {Ti, . . . , T k } between abstract state spaces 
A and B is a family of transformations T k : A — > B such that the effects induced 
by the transformations sum up to the unit effect: 

n n 

J2(uB°T k )(u;)=J2 eT ^ = 1 Vwe ^- ( 3 - 37 ) 

fe=i fe=i 

It has the interpretation that if w is the initial state of the system prior to the 
operation, then with probability eT k (<-o) = (uB°Tk)(uj), the operation transforms 
the state according to T k : 

T k (ui) 



(u B oT k )(u) 



An operation is a stronger formulation than a measurement, in the sense 
that every operation induces a measurement. 

Proposition 3.45: Every operation induces a measurement. 

Proof. At first, we check that for every transformation T k G 0, the effect er k 
induced by T k is indeed an effect. The function er k a linear functional by (|3.34p . 
We have that < f(u) < 1 for all uj 6 tt A by (|33B1) and (pHo) . Finally, the 
effects sum up to one because of Equation (|3.37j) . D 

For now, we close the discussion of transformations at this point. We will 
come back to the issue of transformations in Section T5.1[ where we will finish this 
part of the framework. We will make the assumption that the post-measurement 
states of pure measurements are given by transformations which induce the 
measurement. The reason why we restrict to pure measurements will become 
clear in Section |4"7T1 
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Part II 

The application of the techniques 
and the results 

In Part U we have learned about a particular framework for generalized prob- 
abilistic theories called the abstract state space formalism. We have seen that 
probabilistic mixtures give rise to convex sets of states in Section [2. II In Section 
[31 we have seen how convex sets fit into the abstract state space formalism. Sec- 
tion [2j3| was dedicated to the study of some properties of convex sets in general, 
before we investigated polytopes as a special type of convex sets in Section 12.41 
Polytopcs will be of particular importance in Part [ill When we introduce some 
physical principles and state them as "Postulates" , we do this in the full gener- 
ality of the abstract state space formalism. But the results that we derive from 
these postulates are based on the restriction to polytopic theories (i.e. theories 
where the set Ha of normalized states is a polytope). In other words, we derive 
the results for theories with only finitely many pure states. 

Admittedly, the restriction to polytopic sets of states is artificial and has 
no physical justification. While classical theory (with a simplex as the set of 
states) is a polytopic theory, quantum theory is not. A quantum system with 
a two-dimensional Hilbert space, for example, has a set of normalized states 
which is convex-isomorphic to a closed unit ball (the Bloch sphere), so it has 
continuously many pure states. Hence it is worth saying a few words about how 
quantum theory is related to the results of Part HH 

The idea of the postulates is to 

(a) identify physical principles that are satisfied by quantum theory and 

(b) determine generalized probabilistic theories which violate these postulates. 

Referring to what we have said in the introduction of this thesis, this can be 
regarded as a step towards the higher goal of inferring quantum theory from 
physical principles. This goal would be achieved if we would find physical prin- 
ciples that (a) are satisfied by quantum theory but which (b') rule out all prob- 
abilistic theories except for quantum theory. As (b') seems to be difficult to 
achieve, it might already be a step forward to identify physical principles that 
only rule out some class of probabilistic theories (but not all except for quantum 
theory). This is the concern of the present thesis. In this sense, the restriction 
to polytopes is justified, since we manage to rule out a class of probabilistic 
theories by postulating physical principles that are satisfied by quantum theory. 
We want to point out that the depth of the insight into quantum theory that 
we gain by (a) and (b) significantly depends on two conditions on the principles: 

1. The principles should, as much as possible, be of a physical nature rather 
than of a mathematical nature. The more this physical aspect has an 
operational interpretation (rather than being a hardly accessible, very 
abstract idea), the less mysterious is our picture of quantum theory. 
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2. Within a certain minimal strength of deduction, the principles should 
(appear to) be as weak as possible. Strong physical principles should be 
inferred from a few weak principles rather than being assumed from the 
beginning. 

We approach the idea that we have just described in two different ways by 
considering two different approaches to postulate physical principles. 

We discuss the first approach in Section|U There we consider three principles 
(or postulates). Postulate Q] is called repeatability. It demands that if we perform 
a pure measurement twice in a row, then we get the same outcome both times. 
This requirement constrains the set of possible post-measurement states to a 
certain subset of ft a- Postulate [2] states that the set of all states satisfying 
the repeatability condition have a certain subspace structure, so we call it the 
subspace principle. Postulate[3jis what we call the state discrimination principle. 
Suppose that two sets Ai and A 2 of states can be perfectly distinguished by 
a measurement. Assume that in addition, two subsets A3,A4 C A2 can be 
perfectly distinguished from each other. Postulate [3] claims that in this case, 
the sets Ai,A3 and A4 can be perfectly distinguished. We will see that every 
polytopic theory satisfying Postulates [TJ [2] and [3] is a classical theory, i.e. a 
theory where the set of normalized states is a simplex. 

The second approach is the main result of this thesis and is presented in 
Section[S] It achieves the same (inferring classical theory from polytopic theory) 
by only postulating one simple and plausible physical principle. This principle 
states that if we know the result of a measurement in advance with certainty, 
then we can perform this measurement without disturbing the statistics of any 
other measurement. In other words, a measurement that does not provide any 
information does not disturb the state. We will argue that such a measurement 
can be seen as the readout of classical information. 
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4 Repeatability, subspaces and a state discrim- 
ination principle 

In this section, we consider three physical principles, which we will state as 
"Postulates" , and study their consequences. Although from a technical point of 
view, it is not necessary to introduce post-measurement states, the interpreta- 
tion of two of the postulates relies on post-measurement states to some extend. 
This forces us to be careful. In Section 14.11 we will discuss that Definition 13.251 
of a measurement encompasses a class of measurements which is too general for 
a consistent treatment of post-measurement states. We explain why it is nec- 
essary to restrict to pure measurements when we talk about post-measurement 
states. 

In Section 14.21 we introduce the principle of repeatability and the subspace 
principle. Section [4.3l is dedicated to the state discrimination principle. Finally, 
we will show in Section 14.41 that a polytopic theory which satisfies these three 
principles is precisely a classical theory, i.e. a theory where the set of states is 
a simplex. 

4.1 Post-measurement states and pure measurements 

In the following sections, we will be concerned with post-measurement states. 
Whatever we assume about post-measurement states in generalized probabilistic 
theories, when we apply it to quantum theory, it should not contradict the known 
laws of quantum theory. It is not our goal to disprove quantum theory but to 
understand what makes it special. To respect this, we have to be careful that 
we do not make statements about a too large class of measurements. In the 
following, we state this more precisely. 

In Section 13. 6( we have explained that in order to describe consecutive 
measurements in a probabilistic theory, we need a rule for assigning post- 
measurement states. This is a rule which, given an initial state ui and an effect 
e, gives the post-measurement state w pos t for the case where we perform a mea- 
surement on a system in the state u> and obtain the outcome associated with 
the effect e. This is an assignment (w,e) <-> w post (w,e). We call this an update 
rule for short. 

In quantum theory, the effects are given by POVM elements. We have an 
update rule for the case where the POVM element P is a projector, i.e. P 2 = P. 
This update rule is called the von Neumann- Liiders projection. It makes the 
assignment 

PpP 
(p,P)^ /0post (p,P) = _^. (4.1) 

But as we have mentioned in Example l3.271 projectors are not the only effects in 
quantum theory. One might ask whether the von Neumann-Luders projection 
can be generalized. POVMs are not intended to make statements about post- 
measurement states but only about the statistics of measurement outcomes in a 
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single-shot measurement. However, we might forget about the actual purpose of 
POVMs for a moment and ask whether the the von Ncumann-Luders projection 
can be extended to arbitrary POVM elements. It turns out that this cannot be 
achieved in a consistent way. 

To see this, we consider two different situations which lead to the same 
POVMs but to different post-measurement states. The following example achieves 
this by two different global projective measurements on a larger system. 

Example 4.1 (Projective measurements on a larger system): Let Ha — C 2 be 
the Hilbcrt space of a system A, let T = {i<i , F2 , F3 } be the POVM on Ha 
given by 








F 2 = 




F,= 




This POVM can be seen as being induced by a projective measurement on a 
larger system which contains A as a subsystem. To see this, let Hb — C 2 be 
the Hilbcrt space of an ancilla system B and let 



Pb = 



1 




eS(H B ) 



be the state of system B. In the Kronecker product matrix representation, let 
Q = {G\, G2, G3, G4} be the projective POVM on Ha <S> Hb given by 
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It is easily calculated that for any state pa € S(Ha), we have that 

tr(F iPA ) = tr(Gip A ® Pb) Vt £ {1, 2, 3}, tr(G 4 pA O Pb) = . 
In the same way, instead of Q, we could have chosen the projective POVM 
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Q = {Gi, G 2 , G 3 , G 4 } on Ha <g> H B given by 
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This would induce J 7 as well: 
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tx{Fip A ) = trCGip^ <8> Pb) Vi G {1, 2, 3}, tr{G 4 p A <8> p B ) = 

for any state p^t G S^-La)- 

If we calculate the post-measurement state p^ s e associated with Q and 
trace out the ancilla system B to get the post-measurement p' A g of system A, 

wc get a different state than if we do the same for Q, p' . = ^ p\g. For example, 



tr B 



tr B 



/ Gip A ® PbGx 
to{GipA<® Pb) 

Gipa® pbGi 



tr(Gip A ® Pb) 




(independent of pa), but 
(independent of pa as well) 



This shows that from the POVM T alone, there cannot be a consistent update 
rule. ■ 



Example 14.11 shows that the update rule for generalized probabilistic theories 
should only make statements about post- measurement states for a class of effects 
which, in the case of quantum theory, reduces to orthogonal projectors. In 
generalized probabilistic theories, we no longer have the notion of projectors as 
effects, since effects are elements of a more abstract convex set Ea- Therefore, 
we have to hnd a criterion formulated in the language of convex sets. It turns 
out that projectors are precisely the pure effects of quantum theory. 

Proposition 4.2: Let (A, A + ,ua) be a finite-dimensional quantum theory, i.e. 
let 

• H be an n-dimensional Hilbert space, let 

• S(TL) be the set of density operators, on H, let 

• A = Hcrm('H) be the set of all Hcrmitian operators on H, let 
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• A + be the cone of all positive operators on Ti, let 

• / be the identity operator on H, let 

• E A = [0, 1] = {T(= Hcrm(-H) | < T < 1} C A+. 

Then T is a pure effect (i.e. an extreme point of Ea) if and only if T is a 
projector £3 

Proof. At first, we prove that any clement of [0,/] can be written as a convex 
combination of (orthogonal) projectors. Write T in its eigendecomposition: 

n n 

T = ^A, |ei)(ei|=2AiPi, 

i=l i=l 

where Ai > A 2 > . . . A„. The operator T can be written as 

T = A1P1 + X 2 P 2 + ■■■ + KPn 

= (Ai - A 2 )Pi + (A 2 - A 3 )Pi + . . . + (A„_! - \ n )P l + A„Pi 

+ (A 2 - A 3 )P 2 + . . . + (A„_! - A„)P 2 + A n P 2 

+ ... 

+ (A„-i — A n )P„_i + A n P„_i 

+ A n P„ 
= (Ai - A 2 )Pi + (A 2 - A 3 )(Pi + P 2 ) + . . . 

+ (A„_i - A„)(Pi + . . . + P n _i) + A n (Pi + . . . + P n ) 



= J^ piUi , Pi = Xi - Ai+i > (An+i - 0) , n, = Y^ Pk ■ 

i=l k=l 

Define po = 1 — Ai > 0, so that 

n n 

T = p + ^PjIT and }^Pi = 1 ■ 

i=l i=0 

This proves that every POVM-element is a convex combination of projectors. 
Next prove that a projector cannot be written as a proper convex combination 
of elements of [0, 1]. Let II be a projector and suppose that 

n = AP+(l-A)P, 0<A<1, E,F£[0,I]. 

For any \ip) 6 ker(II), we have 

= (V>|n|V>) = A (i>\E\il>) +(1 - A) {ip\F\ip) > 

>0 >0 

=*► (ip\E\ip) = (ip\F\ip) =0. (4.2) 



19 When we say projector, we always include the zero operator and the identity operator 
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Let C be an operator satisfying C*C = E, and let D satisfy D*D = F. Then 
Q4.2J1 implies 




(C-0|C^) = ||C^|| 2 = =► C\i>) = => E\ip} = V|^) G ker(II) . 
Likewise, we have that 

F\i/>) = V|V>) e ker(n) . 

supp(F),supp(£') C supp(IT) . 

+(l-\)(cf>\F\<t>)<l 



D 

Proposition 14.21 provides us with the criterion that allows us to distinguish 
between projectors and other effects in quantum theory in the language of ab- 
stract state spaces: projectors are pure effects. By what we have discussed 
above, this means that we make the following restriction: 

Whenever we make statements about post-measurement states or 
consecutive measurements, we restrict ourselves to pure effects. 

4.2 Repeatability and subspaces 

The first principle that we want to postulate is the principle of repeatability. 
This is a statement about consecutive measurements and therefore about post- 
measurement states. By the discussion of Section l4~Tj this means that we restrict 
the statement of the principle to the case of pure effects. 

Postulate 1 (Repeatability): If we perform a pure measurement twice in a 
row, then we get the same outcome both times. In other words, if we perform a 
pure measurement M. — \e\, . . . , e„} C Ea and get an outcome associated with 
e,; G A4, then the post-measurement state uj post satisfies ei(u pos t) = 1. 



We can link this postulate to a result that we have inferred in Section 13.31 
Corollary 13.371 states that for a pure effect e G Ea , the set of all states ui G £Ia 
satisfying e(o>) = f is a face of £Ia- We called it the face associated with e. In the 
context of Postulate[T] we will also call it the face of possible post-measurement 
states. 

From a purely technical point of view, we will not explicitly make use of 
this postulate. Instead, it will be implicitly contained in Postulate [2] In other 
words, Postulate [T] will be the motivation for Postulate^ In order to make clear 
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what physical ideas are behind our assumptions, we state Postulate [T] explicitly, 
although this is technically not necessary. 

It is very natural to postulate repeatability. From the von-Neumann Liiders 
projection (|4.1[) . it is easy to see that quantum theory satisfies Postulate [U 

PpP r , s ( PpP \ tr(Pp) 

^ = ^pT) * fp ^^ = tT { P Wpj) = t^j = 1 - 

In the following, we introduce a concept of subspaces. Roughly speaking, a 
subspace is a subset of the set of states which can be treated as a set of states 
in its own right, just as if it would be associated with a different, "smaller" kind 
of system. From a purely mathematical point of view, a subset of a set of states 
only needs to satisfy very little in order to induce the structure of an abstract 
state space. It only needs to be a compact convex subset, as is shown in the 
following proposition. 

Proposition 4.3: For an abstract state space (A, A + , ua) and a compact con- 
vex subset 0,$ C £Ia, the triple (S, S+,us), where S = span(f2s), 5+ = 
cone(fis) (recall Definition 13. 1| and us — ua\si is an abstract state space in 
the sense of Definition 13.191 

Proof. According to Definition I3.19[ we have to check that (a) S is a finite- 
dimensional real topological vector space, (b) S+ is a closed and generating 
cone in S and (c) us is an order unit in S* . (a) is trivial. For (b), recall 
Theorem EHH to see that S+ is closed (trivially, S+ is a cone). S+ is generating 
since S is chosen to be the span of the basis fig. For (c), note that us is the 
restriction of a strictly positive map and therefore is itself a strictly positive 
map. By Theorem 13. 181 us is an order unit in S* . □ 

To simplify the terminology in the following discussion, we define this kind 
of abstract state space, which is induced by the choice of a compact convex 
subset of a set of states as an induced abstract state space. 

Definition 4.4 (Induced abstract state space): For an abstract state space 
(A, A+, ua) and a compact convex subset Cls C CIa, the triple (S, S+, us), where 
S = span($7s), 5+ = cone(fis) and us — ua\s, is referred to as the abstract 
state space S induced by tts C CI a ■ I n this case, we say that the subset Clg 
induces an abstract state space. 

From a mathematical point of view, Proposition 14.31 is a correct statement, 
and there is no reason to forbid Definition 14.41 From a physical point of view, 
however, it is not sensible to consider abstract state spaces induced by arbitrary 
compact convex subsets Cls C Ha as being physical. To see this, we first 
develop an intuitive picture of subspaces, before we specify physical properties 
that we regard as necessary conditions for a "subspace" to be satisfied. Then we 
translate them into the mathematical language of abstract state spaces. Finally, 
we will give some (physical and unphysical) examples. 
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A subspace of a system can be thought of as a subset of the set of states 
that arises from the fact that we are ignorant about some properties that the 
system can have. In other words, a subspace arises in situations where we are 
unaware about the existence of some states of the system. As an example, think 
of a source Q which emits physical systems which seem to be of some particular 
type. We describe this type of system by an abstract state space (S, S + ,us)- It 
might be that our description (S, S+,us) of the system only describes the actual 
physical system partially. This might be caused by the fact that the source Q 
is built in a way which constrains the state of the system. 

source Q 



source Q' 




device D 



device U 



Figure 4.1: A source Q that emits physical systems might be built in a way such 
that it only emits systems in states of a subspace. 



For example, one might think of the source Q as an apparatus which is 
composed of a few smaller apparatuses Q' , D and U (c.f. Figure H3J- The 
apparatus Q' is the actual source of the physical systems. It emits systems of 
some more general kind. Say that we describe this more general kind of system 
by an abstract state space (A, A +1 ua)- The state of a system which leaves 
the source Q' might be in any state uj 6 ft a- After its emission, the system 
enters a device D. This device performs a measurement on the system (the 
measurement has three possible outcomes, say). We might think of the device 
D as a Stern-Gerlach device. Depending on the outcome of the measurement, 
the device sends the system along some path. If the measurement outcome is 
1, the system is directed towards a block where it is absorbed, so in this case, 
the system does not leave the source Q. If the outcome of the measurement is 
2 or 3, the path that the system takes leads to a device U. This device ensures 
that the system leaves the source Q. We might think of this device as a beam 
focussing device which focusses the beam to the output hole of the source. 

If a source Q is built in this way, it prevents some systems from leaving the 
source Q. All systems with a state that certainly leads to the measurement 
result 1 are blocked. An experimenter who performs experiments on systems 
that leave the source Q will never see such a system. He tries to find out 
experimentally what the state space of the system is, the will not find out that 
such a system is described by the abstract state space (A,A + ,ua)- Instead, he 
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describes it by some other abstract state space (S 1 , S+, us), because he only sees 
systems in some set of states Hs- He describes a subspace of the system. 

Consider a quantum example. Suppose the source Q' emits three-level 
quantum systems, i.e. the Hilbert space H is three-dimensional with a ba- 
sis {|1), 1 2), |3)}. The states of the actual system are therefore described by 
Ha = S(H). Assume that the device D performs a measurement with respect 
to the POVM {|1)(1|, |2)(2| + |3)(3|}. If the outcome of the measurement is 1, 
then the system in the state p = |1)(1| is blocked, otherwise leaves the source Q. 
In this case, the experimenter who uses the source Q would describe the states 
of the system by H s = S{W), where W = span({|2), |3»). The subset S(H') 
is a subspace of S(7i). 

Now we describe some properties that we expect from a subspace. We con- 
sider the following physical requirements: 

(a) The states in Hs which correspond to maximal knowledge (i.e. the pure 
states of Hs) are states of maximal knowledge in Ha as well. This is to be 
interpreted as the fact that we cannot gain knowledge by "forgetting about 
the rest of the state space" . 

(b) The set of measurements that can be performed on the subspace arises from 
the convexity structure of the subspace Hs on its own (which allows us to 
consider Hs as a space in its own right), independently of the convex struc- 
ture of the larger set of states Ha containing the subspace. In particular, 
this idea encompasses two requirements: 

(i) The structure of Hs as a subspace of Ha should be compatible with 
the point of view that our ignorance about the larger space Ha does 
not change the fact that actually, we perform a measurement on Ha 
(which is the set of states associated to the type of system in question) 
and not merely on Slg. Therefore, for every measurement Ms on the 
subspace Hs which arises from the convex structure of Hs, there should 
be a measurement Ma on Ha the restriction of which to H$ coincides 
with Ms- 

(ii) Each measurement Ms on the subspace Hs can be performed in a 
way such that the post-measurement state lies in Hs- If this would 
not be the case, we could not regard Hs as being a subspace in its own 
right: A description of consecutive measurements would necessarily 
involve the whole space Ha- This means that for every measurement 
Ms on ^s, the measurement Ma on Ha that induces Ms has a 
face of possible post-measurement states that coincides with the face 
of possible post-measurement states for Ms (this face is given by 
repeatability). 

Now we formulate these requirements in mathematical terms. For a sim- 
plified way of speaking, we call every subset Hs C Ha that satisfies the above 
conditions a physical subspace. A subset Hs C Ha that induces an abstract 
state space but violates one of these principles is called an unphysical subspace. 
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Definition 4.5 (Physical subspace): For an abstract state space (A,A+,ua), a 
compact convex subset Q$ of ^A is a physical subspace of Qa if the following 
conditions are satisfied: 

(a) ext(Q,s) C ext(fl,4.)- 

(b) For every pure measurement A4s = {ei, . . . ,e n } on the abstract state space 
(S, S+,Us) induced by Q$ C CIa, there is a pure measurement AAa = 
{/i, . . . , /„, . . . , /n+fc} (for some k) on (A, A + , ua) such that the following 
properties are satisfied: 

(i) fi\s = ei Vi <E {l,...,n}, 
(ii)F fi =F ei Vie{l,...,n}, 

where F fi = {u e CI a f t (uo) = 1}, F ei = {u> G Cl s ej(w) = 1}. 

A subset Cls C CIa is an unphysical subspace if it induces an abstract state 
space but violates (a) or (b). 

For condition (a) of a physical subspace, we already know a simple sufficient 
criterion: As a corollary of Proposition 12.291 we have that faces satisfy (a) . 

Corollary 4.6: For an abstract state space A, every face Cls of Ha satisfies 
ext(Og) C ext(flA)- 



Proof. By Proposition 12.291 Cls satisfies Cls = conv({w g ext(fiyt) | v £ ^s})- 
This implies ext(f2s) C {v £ ext(f^) | v E Cls} C ext(fU)- □ 

Example 4.7 (Unphysical and physical subspaces): We consider a few examples 
of induced abstract state spaces (c.f. Figure |4~2]) . The first three examples are 
unphysical subspaces. They show that not every compact convex subset of a 
set of states can be regarded as a physical subset. The fourth and fifth example 
are physical subspaces. 






(2) 

Figure 4.2: The first three examples are unphysical subspaces: (1) a circular 
subset of a square, (2) a face of a pentagon, (3) a face of a square. The fourth 
example is a physical subspace: (4) a face of a tetrahedron. 
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(1) A (filled) circular subset of a square is a compact convex subset, but it 
violates the requirement that every extreme point of the subset should cor- 
respond to a pure state of the whole set of states. 

(2) Let £Ia be a regular pentagon (which is the polygon model corresponding to 
n = 5). Suppose that $7$ is a facet of Ha- According to Corollary 14.61 fts 
satisfies condition (a). However, it violates condition (b) (i). To see this, 
note that £ls is a line-segment, i.e. a 1-simplex, so the induced abstract 
state space (S,S + ,us) is a classical theory of two pure states. According 
to Proposition [33U these two pure states can be perfectly distinguished by 
a measurement on S. But in the pentagon model, the two vertices of an 
edge cannot be perfectly distinguished by a measurement on A (as one can 
calculate from (|3.20[) . 

(3) Let VLa be a square (the n — 4 polygon model) . A facet £ls of ft a satisfies 
(a). In contrast to the previous example, it also satisfies condition (b) (i). 
To see this, consider the face F e3 associated with e^ (see Figure [473]) . The 
vertices of this 1-simplex are W2 and u)?,. They can be perfectly distinguished 
by the measurement AAa = {e2,e4J since u>2 £ F &2 and L03 6 F es . But 
5I5 violates (b) (ii). The measurement M.a = {e2,e4J is the only pure 
measurement on A that perfectly distinguishes 102 from W3, and for this 
measurement, F e . 2 ^ {uj 2 }, F ei ^ {w 3 }. 




Figure 4.3: A facet Vt$ of the n — 4 polygon model violates condition (b) (ii) of 
Definition 14. 5 1 so it is not a physical subspace. 

(4) A face of a tetrahedron is an example of a physical subspace. It satisfies all 
the requirements of Definition 14.51 

(5) Let H be a finite-dimensional Hilbert space, let %' C T-L be a subspace of H 
(in the vector space sense). Then S{%') is a physical subspace of S(V.). ■ 

As we have mentioned above, physical subspaces naturally arise in the con- 
text of measurements. It is instructive to discuss this in more generality for 
the example of quantum theory. Consider a quantum system with a finite- 
dimensional Hilbert space 7i, i.e. with £Ia = 5(H). If we perform a projective 
measurement A4 = {Pi}i£i M and obtain the outcome k G Im-, then we know 
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that the state of the system after this measurement is a density operator on the 
image of the projector, i.e. p pos t S S(H') with H' = PkH- On the other hand, 
S(H') is exactly the subset of S(H) which is compatible with the repeatability 
of the measurement (Postulate [1]): For every density operator p £ S(H), we 
have that fp k {p) = tr(Pkp) = 1 if and only if p is a density operator on 'H' , i.e. 
P eS(H'). 

The subspace principle states that this is true for a general abstract state 
space. It combines the concepts of repeatability and subspaces. Repeatability 
requires the post- measurement state of a system to lie in a certain face of Q,a- 
We postulate that this subset is a physical subspace. 

Postulate 2 (Subspace principle): For an abstract state space A, let Q$ be a 
face of 'Qa associated with a pure effect f £ Ea, i-e. Cls = Ff. In other words, 
let fts be a face of Qa such that 

3/ £ Ea pure : /(w) = 1 O w e fi s Vw G CIa- (4.3) 

Then tts * s o, physical subspace according to Definition \4-5\ 

4.3 A state discrimination principle 

In this section, we introduce a state discrimination principle for abstract state 
spaces. It concerns the notion of perfect distinguishability of states. Recall 
from Definition 13.301 that states u>x , . . . , w„ £ Ha Q>re perfectly distinguishable 
if there is a measurement {fi,...,f n } c Ea such that fi(uij) — Sij for all 
i,j £ {l,...,n}. 

Example 4.8 (Perfectly distinguishable states): In order to get a better feeling 
for the notion of perfect distinguishability, we make a few examples. 

(1) In quantum theory, states p\,...,p n £ S(H) are perfectly distinguishable if 
and only if they have support on pairwise orthogonal Hilbert subspaces: 

Pi, . . . , p n are perfectly distinguishable ^ supp(pi) _L supp(,Oj) Vi =^ j . 

In this case, a measurement which perfectly distinguishes between the states 
pi, . . . ,p n would be given by the projectors Pk onto the supports supp(pfe) 
of the density operators! 20 ! 

(2) In the polygon model with n = 3 (c.f. Example I3.28[) . which corresponds 
to a triangle, the pure states u>x,u>2 and W3 are perfectly distinguishable by 
the measurement {d, 62,63}, as is easily verified from (|3.19p and (J3.20I) . 



(3) In Example 14.71 (3), we have seen that the states W2 and W3 of the n = 4 
polygon model are perfectly distinguishable states. I 



_>o 



If Ptotal = SiLi Pk is a projector onto a proper Hilbert subspace of 'H, we add I-n — P\ 



tot 



to one of the projectors to get a measurement. 
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To state the state discrimination principle properly, it is convenient to gen- 
eralize the definition of perfect distinguishability to sets of states. 

Definition 4.9: For an abstract state space A, sets Ai, . . . , A„ C SI a are per- 
fectly distinguishable sets of states if there is a measurement {e±, . . . , e„} 
such that for every i e {1, . . . , n\, we have that 

ei(uj) — 1 Vw G A^ ei(u) = Vw 6 Aj if j ^ i . 

If n = 2, we say that A\ can be perfectly distinguished from A2. 

This definition of perfectly distinguishable sets of states reduces to the Defi- 
nition [3301 OI perfectly distinguishable states in the case where set each Aj only 
contains one state. 

With this definition at hand, we can state the state discrimination principle. 

Postulate 3 (State discrimination principle): Let A be an abstract state space, 
let Ai,A2 C Qa be perfectly distinguishable sets of states. Assume that in ad- 
dition, there are subsets A3,A4 C A2 such that A3 is perfectly distinguishable 
from A4. Then Ai, A3 and A4 are perfectly distinguishable. 

This is a natural assumption. An everyday life example: If I can distinguish 
black hats from colored hats and blue hats from red hats, then I can distinguish 
between black, blue and red hats. The state discrimination principle also holds 
in quantum theory, as can be seen in the following example. 

Example 4.10 (State discrimination principle in quantum theory): Let T-L be 
a four-dimensional Hilbert space. Let Ai = {p\,p2\ and A2 = {ps, p 4 } be 
perfectly distinguishable sets of states. In quantum theory, this means that 

span(supp(/9i), supp(/9 2 )) -L span(supp(/9 3 ), supp(/9 4 )) • 

Suppose that {P3}, {/0 4 } C A 2 are perfectly distinguishable sets of states, i.e. p 3 
and pi are perfectly distinguishable states. This means that 

supp(p 3 ) _L supp( j o 4 ) ■ (4.4) 

Then we have that 

supp(p 3 ) C span(supp(p 3 ), supp(p 4 )) -L span(supp(pi), supp(p 2 )) 

=> supp(/9 3 ) _L span(supp(pi), supp(p 2 )) , (4.5) 

supp(p 4 ) c span(supp(p 3 ), supp(p 4 )) _L span(supp(pi), supp(p 2 )) ■ 

=> supp(p 4 ) _L span(supp(pi), supp(p 2 )) • (4.6) 

Quantum theory satisfies the state discrimination principle: Equations (|4.4[) . 
(J4.5I) and (|4.6p together imply that A2, {^3} and {p 4 } are perfectly distinguish- 
able sets of states. ■ 
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While classical and quantum theory satisfy Postulate [31 there are theories 
in which this is not the case, as the following example shows. 

Example 4.11 (Violation of the state discrimination principle in the n = 4 
polygon model): Once again, we consider the case where H,a is a square. Con- 
sider the sets Ai = {wi,W2} and A2 = {0)3, W4}. They can be perfectly distin- 
guished by the measurement {e2,e4J (see Figure I4T41 . Let A 3 and A 4 be the 
subsets A 3 = {w 3 },A4 = {^4} C A 2 . They can be perfectly distinguished by 
the measurement {ei,es}. If the square would satisfy the state discrimination 
principle, then the sets {wi,oj2}, {^3} and {0J4} would be perfectly distinguish- 
able. In particular, the states 1^2, ^3 and uja would be perfectly distinguishable. 
However, there is no measurement on the square which distinguishes states. 




Figure 4.4: The square violates the state discrimination principle. 



4.4 Result: Classical theory derived from polytopic the- 
ory and three postulates 

In this section, we prove the first result of this thesis. It states that every 
polytopic theory which satisfies Postulates [TJ [2] and [3] is a classical theory. Recall 
from Examples 13.291 and 13.321 that an abstract state space is a polytopic theory 
if VIa is a polytope and a classical theory if VLa is a simplex. 

As a first step towards the result, we prove that if A is a polytopic theory 
that satisfies Postulate [2] then for every face F of VLa, there is a pure effect 
/ G Ea such that the face Ff associated with / coincides with F, i.e. /(w) = 1 
if and only if ui £ F. Note that without Postulate^ this is not true in general. 
We know from Proposition 13.411 that for a polytopic theory, the statement is 
always true if F is a facet, but as we have mentioned on page [57] in Section 13.31 
in the case where F is a face, this is not true in general. 

Lemma 4.12: Let Abe a polytopic theory which satisfies Postulate^ Then, 
for every face F of Ha, there is a pure effect / G Ea such that for every u G Ha, 
we have that /(w) = 1 <=> cj G F, i.e. F = Ff. 
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Proof. We prove this Lemma by induction over the dimension of the face (c.f. 
Definition 12. 32p . Say that £Ia is a d-polytope. 

• Base case: F is a (d — l)-face 

If F is a (d— l)-face, then F is a facet of Qa- We have proved in Proposition 
13.411 that in this case, there is a pure effect / £ Ea such F = Ff. 

• Inductive step: the case where F is a (d— /c)-face (for some k > 2) 

Assume that for every (d— fc+l)-face G, there is a linear functional g 6 _E^ 
such that G = F g (induction hypothesis). Let F be a (d — &)-face of 57,4. 
Our goal is to show that there is a pure effect / £ Ea such that Ff = F. 

By Proposition 12.391 there is a sequence Qa D Fa d Fd-i D . . . D 
-Fd-fc+i D -Frf_fc = -F of faces of VLa such that i^_i is a facet of i^ for 
alii 6 {d, ...,d — k + i}. 







F d -2 - Fg 



Figure 4.5: This figure shows the sequence Qa D Fd D Fd-\ D ■ ■ ■ D Fd-k+i 3 
Fd-k = F for the case where SIa is a tetrahedron (d = 3) and where F is a 

vertex. 

Fd-k+i is a (d — k + l)-face of 51^, so by the induction hypothesis, there is 
a pure effect g £ Ea such that Fd-k+i = F g , By virtue of Postulate^ this 
implies that Fd-k+i is a physical subspace of £Ia- This means that the 
abstract state space (S = span(F d _ fe+1 ),5 + = conc(F (i _/ £+1 ),U5 = ua\s) 
induced by Qs = -Fk-fc+i is an abstract state space in its own right. 
F — Fd-k is a facet of fig. By Proposition l3.41[ there is pure effect e £ E$ 
on the physical subspace S such that F e = {u £ Q$ I s(u>) = 1} = F. 
Proposition 13.331 tells us that the complementary effect e = us — e £ Es 
is pure as well, so {e,e} is a pure measurement on S. 

As we have mentioned above, Qs = Fk—i is a physical subspace by Pos- 
tulate [21 By the definition of a physical subspace, there is a pure mea- 
surement {/, /2, . . . , fi} such that (in particular) Ff = {uj £ fl A \ f(oj) — 
1} = F e = {to £ Q s | e(w) = 1} = F. This proves the claim. □ 

Lemma 14.121 allows us to prove the result of this section. 

Theorem 4.13: Let A be a polytopic theory which satisfies Postulates [TJ [5] and 
[3J Then A is a classical theory. 
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Proof. Say that SIa is a d-polytope. The idea is to prove the theorem by the 
following four steps. 

(i) At first, we show that there is a sequence SIa D F D Fl D . . . D Fd of 
faces of fi^, where F i+i is a facet of Fj for all z € {0, . . . , d— 1}, such that 
the last face consists of only one point, Fd = {w}. 

(ii) From this sequence, we construct a sequence Fi, . . . , Fd,Fd of perfectly 
distinguishable sets of states. 

(iii) From this sequence in turn, we construct (d+1) perfectly distinguishable 
states wi, . . . , u>di 2. 

(iv) Finally, we show that this implies that SIa is a simplex. 

Now we prove each of the steps. 

(i) Let LJd G SI a be a pure state. In the language of convex sets, cud is an 
extreme point, so {uid} is a face of SIa- By Proposition 12.391 there is a 
sequence SI a D Fq D Fi D . . . D Fd of faces of SI a, where F,+i is a facet 
of F for all i G {0, . . . , d — 1}, such that Fd = {wd}- 

(ii) Fl is a facet of fi^ . This allows us to apply Proposition 13.411 to see that 
there is a pure effect /i 6 F^ such that the face Ff 1 associated with /i 
coincides with F, i.e. Fl = Ff 1 = {ui G fi^ | /i(w) = 1}. By Proposition 
I3.33[ we have that the complementary effect /i := ua — /i is a pure 
effect. From Corollary 13. 37) we get that there is a face Fj- such that 
Fy- = {oj € SI a | /i( w ) = !}• By Definition 13.381 we call this the opposite 
face Fi := Fj x — Fj-. The effects {/i, f{\ form a measurement, so 

/i(w) = i VwgFi, /i(«) = V^eF, 

7i(o;) = VwGFi, 7i(^) = l VweF. 

Therefore, 

the sets Fi,Fi C f^ are perfectly distinguishable. (4.7) 

Fi = F/j is a face associated with a pure effect, so by Postulate [2] 
fi s = Fi is a physical subspace. Consider the abstract state space (S 1 = 
span(Og), S+ = A + flS^iij = u^ls) induced by fl s = Fi. F 2 is a facet 
of fig, so we can apply Proposition 13.411 to see that there is a pure effect 
e 2 G F s such that F 2 = F e2 = {w G fi s | e 2 (w) = 1}- Like before, we 
can use Proposition 13.331 and Corollary 13.371 to get a pure measurement 
{e 2 ,e2~} on S 1 and a face F 2 of f2 s such that F 2 and F 2 are perfectly 
distinguishable: 

e 2 (w) = 1 Vo; e F 2 , e 2 (w) =0 Vw G F 2 , 

e2(w) =0 Vo; e F 2 , e 2 (w) = 1 Vw e F 2 . 
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Qg is a physical subspace, so there is a pure measurement {51, 52, • • ■ , 32+fe} C 
Ea (for some fc) on A such that <7i|gi = e 2 , 32 Is 1 =62 and F ffl = {w £ 
Q^ I <7i(w) = 1} = F e2 = {w e Uj e 2 (w) = 1} = i<2 and likewise 
_F ff2 = Fg^. Define fa = gi, fa = 2j=2 ft- We have constructed a mea- 
surement {/2, /2J on A such that 

/a(w) = l VweF 2 , / 2 (w) = Vwei^, 

7£(w) = Va; e F 2 , / 2 (w) = 1 Vw e 7%. 



Thus, 



the sets F 2 ,F 2 C i*\ are perfectly distinguishable. (4.8) 



Now we combine (14.71) and (|4.8[) and use Postulate [3] to see that i*i , F% , F2 
are perfectly distinguishable sets of states. 

F3 is a facet of F 2 . By Proposition 12.81 F3 is a face of fi^- We have 
shown in Lemma 14.121 that this implies that we have a pure effect fa G Ea 
such that F3 = Ff 3 . By completely analogous reasoning as above, we get 
perfectly distinguishable faces Fi,F2,Fa,Fa of Qa- We iterate this process 
until we get perfectly distinguishable faces i*i, . . . , Fd, Fd = {w}. 

(iii) This step is easy. We simply have to choose some state uj t £ Fi for every 
i e {1, . . . , d} to get (d + 1) perfectly distinguishable states wi, . . . , u>d, w. 

(iv) The states Wi, . . . , cj^, w are perfectly distinguishable, so by Proposition l3.31l 
J7^ is a d-simplex. Therefore, A is a classical theory. □ 



82 



5 Main result: One simple postulate implies that 
every polytopic state space is classical 

This section is dedicated to the main result of this thesis. It states that every 
polytopic theory which satisfies a very simple postulate is a classical theory. 
The postulate claims that if the outcome of a measurement can be predicted 
in advance with certainty, then performing this measurement does not disturb 
the state. Although this postulate looks very weak, we show that it is strong 
enough to rule out all polytopic theories except for classical theory. 

In Section 15. 1[ we recapitulate the concept of transformations and opera- 
tions. We will finish our definition of the framework of abstract state spaces by 
making the assumption that every pure measurement has an associated opera- 
tion. As we will discuss, our postulate then becomes a preservation principle for 
the transformations associated with pure measurements. Then, we will examine 
some consequences of this principle by considering the examples of the triangle, 
the square and the pentagon. We will prove the main result in Section 15.21 
At first, we will show that every polytopic theory satisfying the preservation 
principle has a certain property. We will say that such a polytope is uniformly 
pyramidal. Finally, we show that every uniformly pyramidal polytope is a sim- 
plex, and therefore the theory in question is classical. 

5.1 Transformations and a preservation principle 

In section 13. 6[ we have seen how transformations naturally arise in the con- 
text of measurements. We have defined a transformation T to be a linear map 
T : A — > B which is positive and does not increase the norm. However, we did 
not make an explicit assumption about how measurements are related to trans- 
formations. The reason why we did not make such an assumption in Section l3~6l 
already is that we did not know at that point that we have to make the restric- 
tion to pure measurements. In Section 14. 1\ we have argued that whenever we 
make statements about post-measurement states, we have to restrict ourselves 
to pure measurements. Now that we know about this restriction, we resume our 
discussion of transformations. 

We have discussed in section l3~6l that measurements encompass two aspects: 
the statistic of the measurement and the disturbance of the statistic of subse- 
quent measurements. The first aspect is covered by treating a measurement as 
a set of effects. In Section HI we technically did not make explicit statements 
about how a post-measurement state should look like. The only principle which 
explicitly said something about post-measurement states was Postulate [TJ but 
technically, we did not assume it. Instead, the idea of repeatability was, to some 
extend, implicitly contained in Postulated] The fact that we did not explicitly 
deal with post-measurement states allowed us to treat measurements as sets of 
effects. We did not have to deal with transformations. In this section, however, 
we do make explicit statements about post-measurement states, so we have 
to set up how transformations are related to measurements in our framework. 
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We do this by the following assumption. Since this assumption is part of the 
framework, we state it as an "Assumption" rather than as a "Postulate" . 

Assumption 5: When a pure measurement M. = {ei,...,e n } is performed 
on an abstract state space A, then the state of the system is transformed ac- 
cording to an operation Om — {T ei , • ■ • , T ek } from A to A which induces the 
measurement M. (recall from Definition | ff.^ffl that this means e^ = (ua ° T ek ) 
for all k E {1, . . . , n}). We call this the operation Om associated with the 
pure measurement M, and for every k £ {1, . . . ,n}, we say that T ek is the 
transformation associated with the pure effect e&. 

This fixes our framework. Now we are ready to explain the postulate of this 
section. The idea is the following. Suppose that a physical system has some 
property. When we say property, we mean something that we can find out by 
a measurement. Moreover, when we say that the system has the property, we 
mean that the system is in a state such that the measurement outcome that 
reveals this property has probability one, i.e. we can predict the outcome of 
the measurement with certainty. It is natural to assume that in this case, we 
can perform the measurement without altering the state. In formal terms, this 
reads as follows. 

Postulate 4: Let A be an abstract state space, let M. = {ex, . . . , e„} be a pure 
measurement on A and let Om = {T ei , ■ • ■ , T Bn } be the associated operation. If 
u £ £Ia is a state with a certain outcome, i.e. efc(w) = 1 for some k G {1, . . . , n}, 
then the associated transformation T ek satisfies T ek (uj) — u), i.e. 

(ua oT ek ){tjj) = e k (uj) = 1 => T ek (ui)=uj. 

In other words, each transformation T ek €E Om leaves the face F ek associated 
with the effect e^ invariant. 

Quantum theory obeys this postulate. Consider a spin-i particle that is 
heading towards a Stern- Gerlach device which measures the spin of the particle 
with respect to the z-axis. We can predict the outcome of the measurement with 
certainty if the spin state of the particle is either p — \ tz)(tz | or p = \\. z ){\. z |- 
For example, if the state is p = \ tzXtz L then it is certain that the Stern- 
Gerlach measurement reveals the outcome "up" (which is associated to the 
projector Pi — \ tz)(tz |) since tr(Pip) = 1. In this case, the von Neumann- 
Luders projection (|4.ip does not alter the state: 

P^pP^ 
Ppost = t^pjp) = P ■ 

Classical theory satisfies Postulate |4] as well. It is instructive to see this 
geometrically. We examine the case of a classical theory with three pure states. 

Example 5.1 (Postulate [4] in a classical theory): Consider the polygon model 
corresponding to n = 3. In this model, Qa is a triangle and therefore a simplex, 
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so it is a classical theory. Consider the pure effect e 2 = ua 
-Fej associated with ejj is an edge of the triangle (see Figure [5TT 



62- The face 
The effect e2 



is contained in the pure measurement {e2,e2J. According to Postulate HJ the 
transformation T^- associated with ej leaves the face F^ invariant, i.e. 



J-e^ \^e 2 ) — ^e; 



(5.1) 



On the other hand, the effect €2 vanishes on the face F^ — F &2 opposite to F^. 
The transformation T-^ induces the effect S2, i.e. e2 = {ua °^W) (c.f. Definition 



13.431 and Assumption [5]) . Thus, u J 4(T^-(w)) 
that 



for all u G F e , . This means 



T^(F e2 ) = {0}, 



(5.2) 



i.e. the face F e2 opposite to F^ is mapped to the zero vector. The transfor- 
mation Tgj- is linear, so Equations (|5.1|) and (|5.2|l fully determine T^- : ^4 — >• A 
since dim(span(Fej)) + dim(span(F e2 )) = 3 = dim(A). 



W 





Figure 5.1: The transformation T— maps the set VLa (the gray triangle on the 
left) into the set conv(Fgj U {0}) (the gray triangle on the right). 



For the understanding of the proof idea in Section [521 h is worth visualizing 
the action of the map T^. It maps the triangle VLa to the triangle conv(Fe 2 -U{0}) 
(gray regions in Figure I5TTJ) , while the edge F^ is fixed. Therefore, it maps 
Qa to a subset of fi-j (the upside down tetrahedron on the right of Figure 
15. ip . so it is a positive map which does not increase the norm (cf. Definition 
I3.43[) . Thus, Tgj is indeed a transformation. We can say that for every face F e 
associated to a pure effect e, Postulate 0] says that the associated transformation 
T e maps the set conv(F e U F e ) into the set conv(F e U {0}). In the above case, 
conv(F e Ul;) = conv(i% U F e2 ) = fl A - ■ 

As we have seen, classical theory and quantum theory satisfy Postulate SJ 
In contrast, all polytopic theories which are not classical (i.e. where the set of 
normalized states $7^ is a polytope but not a simplex) violate this postulate, as 
we will prove in the next Section. 

Before we prove this result in full generality, we investigate some conse- 
quences of Postulate [4] by examining two examples. This helps to understand 
Postulate H] geometrically. 
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Example 5.2 (Violation of Postulate 2] in the square and the pentagon model): 
Once again, we consider the polygon models that we have already encountered 
in Examples l3. 281 |3~MI 14.71 and !4. Ill We show that the square and the pentagon 
violate Postulate 0J 

(a) Consider the polygon model corresponding to n = 4 (the square). As we 
have mentioned in earlier examples, the face F ei associated with the pure 
effect ei is an edge of the square (see Figure I5T2")) . The pure effect ei is 
contained in the pure measurement {ei, 63}. Assume that Postulate@]holds 
(we will see below that this leads to a contradiction) . This implies that the 
transformation T ei associated with e\ leaves the face F ei invariant: 



-* ei \-T ei ) F e 



(5.3) 



On the other hand, the effect e\ vanishes on the face F ei opposite to F ei . 
The transformation T ei induces the effect ei, i.e. e\ — (ua ° T ei ). Thus, 
UA(T ei (ui)) = for all u G F ei . This means that 

T ei (iQ = {0}, (5.4) 



i.e. the opposite face F ei is mapped to the zero vector. This leads to a 
contradiction: Since T ei is linear, the equations (|5.3[) and (|5.4p cannot be 
satisfied at the same time. Equation (|5.3p specifies T ei on a two-dimensional 
subspace (namely span(i 7 ' ei )). The transformation T ei : A — > A is a map on 
a three-dimensional space A, so the linearity of T ei only leaves the freedom 
to specify the image T ei (w) for one point u outside of span(i 7 ' ei ). It is not 
possible to map the whole edge F ei to zero (see Figure 1531) . 




TeAF ei ) 



Figure 5.2: The equations (|5.3[) and (15.41) cannot be satisfied at the same time. 
If F ei is left invariant, then the edge F ei cannot be mapped to zero. Only one 
point of F ei can be mapped to zero, which is chosen to be ^2 + 2 W 3 here. 

This example shows that if dim(span(i 7 ' e )) + dim(span(i 7 ' e )) = 4 > dim(A) 
for some pure effect e s Ea, then Postulate |4] cannot be satisfied. 

(b) Now we consider the regular pentagon, i.e. the n — 5 polygon model. As 
we have seen in Example 13.391 the face F e3 associated with the pure effect 
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e 3 is an edge of the pentagon (see Figure 153)) . The opposite face F ea = F e . 
is a point, so in the case of the pentagon, one has that dim(span(_F , e3 )) ^ 



dim(span(i 7 ' e3 )) = 3 



dim(A). Thus, the conditions 
T^(F e3 ) = {0} 



(5.5) 
(5.6) 



do not lead to a contradiction. They can both be satisfied at the same time. 
However, the pentagon also violates Postulate 21 By the linearity of the 
transformation Tgj, the equations (|5.5j) and ()5.6[) fully determine T^. As 
one can see in Figure [5T31 T-^ maps FIa to some set T^(Q,a) which is not 
contained in fl^ (and therefore it is not contained in A+). This means 
that Tgj is not a transformation according to Definition 13.431 It maps the 
set VLa C A + to a set which is not contained in A + , so T-^ is not positive. 





Figure 5.3: For the pure effect ejj in the pentagon model, there is a linear map 
T^- which satisfies T-^{F e A = F e3 and T-^{F e3 ) = 0. However, this map is not 
a transformation, since T-^(JIa) (dotted lines) is not contained in A + , so T^ is 
not positive. ■ 



Example [521 shows two things that can lead to a contradiction to Postulate HI 
In the square example (a), we have seen that it might be that there is some pure 
effect e € Ea such that T e (F e ) = F e and T e (F e ) — {0} cannot be satisfied at 
the same time. In the pentagon example (b), the equations T e (F e ) = F e and 
T e {F e ) — {0} can both be satisfied, but the only map which satisfies these two 
equations is not positive. In Section 1531 we will see that any polytopic theory 
cither has one of these two problems or it is a simplex. 

With a little thought, one can already guess from example 15. 21 (b) why every 
polygon £Ia that satisfies Postulate|4]must be a simplex (i.e. a triangle). We only 
make a very rough argumentation here. Assume the case where T e (F e ) = F e and 
T e (F e ) = {0} can both be satisfied. For the map T e to be positive, it has to fit 
Qa into conv(F e U {0}). The map T e : A — > A is not bijective since T e (F e ) = 0, 
but the restriction of T e to f2^ is an affine bijection, so one can visualize that 
T e (fiA) C conv(F e U {0}) can only be satisfied if Ha — conv(F e U F e ). In this 
case, £Ia is a triangle because F e is a facet of ft a, i-e. a line-segment, and F e is 
a point. 
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5.2 The Result 

In this section, we prove that every polytopic theory which satisfies Postulate 0] 
is a classical theory. In other words, if A is an abstract state space such that SIa 
is a polytope and such that Postulate 0] is satisfied, then SIa is a simplex. We 
prove this result in two main steps. In the first step, we define what it means for 
a polytope to be uniformly pyramidal. We prove that for a polytopic theory A 
which satisfies Postulate 01 the polytope SI a is uniformly pyramidal. This is the 
"physical" part of the proof. The second part is the proof that every uniformly 
pyramidal polytope is a simplex. 

Definition 5.3: A polytope P with a facet B is called pyramidal at B if P 

is a pyramid with base B and some apex as, i.e. P = conv(B U {as}) (c.f 
Example \2.4(ty - A polytope is uniformly pyramidal if it is pyramidal at every 
facet. 

It will turn out that simplices are the only polytopes that are uniformly 
pyramidal, so they are the only example that we can make. The property of 
being uniformly pyramidal is easily visualized on a polyhedron (see Figure [ 




Figure 5.4: The tetrahedron is a uniformly pyramidal polytope. Each face is 
a base of the tetrahedron as a pyramid, where the apex is given by the point 
opposite to the face. 

The first step is to prove that every polytopic theory A which satisfies Pos- 
tulate 0] has a uniformly pyramidal set of states SI a- Before we prove this, it is 
convenient to introduce the following notation. 

Notation: For a subset M C SIa of the set of normalized states, we define 
M^ 1 := {cue A + \3a e M, a € [0, 1] : u = aa}. 

Lemma 5.4: Let A be a polytopic theory which satisfies Postulate 0J Then 
SIa is uniformly pyramidal. 

Proof. We prove this lemma in three steps, 
(i) In the first step, we consider an arbitrary facet F of SIa- We know from 



Proposition 13.411 that there is a pure effect / G Ea such that F — Ff. We 
show that the opposite face F :— Fj (c.f. Definition 13. 38|) is a vertex of 

the polytope, i.e. F = {a_p} for some ap G Ha- 

(ii) In the second step, we consider the transformation Tf associated with the 
effect /. We show that the restriction of Tf to conv(FU{a,F}) is a bijection 
from conv(F U {a F }) to F^ 1 . 

(hi) Finally, we show that this implies that Ha = conv(FU{aF})- Since F is an 
arbitrary facet, this is sufficient to show that Ha is uniformly pyramidal. 

Now we prove the three steps. 

(i) Let F be a facet of Ha- According to Proposition 13.411 there is a pure 
effect / G Ea such that F = Ff := {uj e Ha | /(w) = 1}. By Proposition 
13.331 the complementary effect / := ua — f is pure as well, so we have that 
/ is contained in the pure measurement M = {/, /}. By Assumption [5j 
M. is associated with an operation Ojvt — {Tf,Tj} which induces M.. 
Postulate 2] tells us that that the transformation Tf satisfies Tf(u>) = uj for 
all uj 6 F. By the linearity of Tf, we have that Tf restricted on span(F) 
is the identity map on span(_F): 

2/lspan(F) = 4pan(F) ■ (5-7) 

The effect / is pure, so by Corollary 13.371 there is a non-empty face F 
such that F = Fj := {uj G Ha | /(w) = 1} (in Definition 13. 38| we called 
this the face F opposite to F). We want to show that the restriction of Tf 
to span(i^) is the zero-map. 

The effects M. = {/, /} form a measurement, so the fact that / takes the 
value 1 on F implies that / takes the value on F since (/ + /)(w) = 
ua(oj) = 1 for all u G Ha- The transformation Tf induces the effect / (by 
Assumption [5]) , so f(ui) — UA(Tf(u>)) = for all uj G F. This implies that 
Tf(oS) =0e Q.2 1S the zero-state for all uj G F. By the linearity of Tf, 
this implies that the restriction of Tf to span(i ;l ) is the zero-map: 

^/Ispan(F) = O s pan(F) ■ ( 5 '^) 

Our assumptions require that both (|5.7p and (15.8^ are satisfied. This is 
only possible if span(F) n span(F) = {0}. Let d — dim A. Then Ha is 
a (d — l)-polytope. We have assumed that F is a facet of Ha, so aff (F) 
is (d — 2)-dimensional. Thus, dim(span(i 7 ')) = d — 1. To fulfill span(F) n 
span(F) = {0}, span(F) must be one-dimensional. This implies that F 
consists of only one point which we call of, i.e. F — {ap}. 

(ii) The transformation Tf is linear. By Theorem l2.191 this implies that Tf is 
an affine map. Thus, the restriction T/| a ft(Fu{a F }) 0I Tf to &S(F U {o_p}) 
is an affine map. We have seen in Proposition 12.211 that an affine map 



is injective if it maps affinely independent points to affinely independent 
points. This is the case for Tf\ a g( FU { aF yy. Equation (|5.7p implies that 
Tf\ag(F) — -^aff(F)i an d &f (which is affinely independent of aff(-F) by 
Proposition 12.101) is mapped to the zero vector (which is affinely inde- 
pendent of Tf(a,S(F)) — aif(.F)). Thus, T/| aff ( F u{a F }) is an affine in- 
jection. We have that conv(F U {op}) C a.ff(F U {of}), which implies 
that 2/| conv (/? U { aF }) is an injection. The map Tf is affine, so by Propo- 
sition H2D1 one has that T/(conv(F U {a F })) = conv(T / (F U {a F })) = 
conv(T/(F) U {T> (a F )}) = conv(F U {0}) = F^ 1 . All in all, we have 

X)(conv(FU{a F })) = F- 1 , (5.9) 

TfU iF u{a F }) ■ aff(FU {a F }) -»• atf^ 1 ) is bijective . (5.10) 

(iii) The set of normalized states £Ia is a polytope, so £1^ = conv(fi>i U {0}) is 
a pyramid (c.f. Example 12 .40|) . The set F is a facet of £Ia, so we get from 
Lemma H2H that F^ 1 = conv(F U {0}) is a facet of Q|\ By Proposition 
12.101 this implies that 

f^ 1 = &&(F^ 1 )nnf. (5.ii) 

Assume that there is a a G f2^, er ^ conv(F U {a F }). The proof is finished 
if we manage to show that this is impossible (since this implies conv(F U 
{a F }) C Ha, and conv(FU{a F }) C £Ia is trivial). The subset F is a facet 
of £Ia (i.e. F is of one dimension less than VLa) and a F G £1^ is affinely 
independent of F (by Proposition l2~T0|) . so aff(FU{a F }) = aff(il^). Thus, 
a G Qa implies 

o-eaff(FU{a F }). (5.12) 

Note that for any subset M of a vector space, one has that 

aff (M) = aff (conv(M)) . (5.13) 

From (|5.12[) . we get that 

T f (a) G T f (afi{F U {a F })) V 1 X)(aff(conv(F U {a F }))) . (5.14) 
The map Tf is affine, so by Proposition ^. 20[ we have that 
T)(aff(conv(F U {a F }))) = aff(T/(conv(F U {a F }))) 

'Paff^ 1 ). 
Equations (|5.14[) and (|5.15[) imply 

XH^Gaff^ 1 ). (5.16) 
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(5.15) 



The map Tj : A — > A is a transformation (c.f. Definition I3.43p . so a G Ha 
implies 

T f (a)enf. (5.17) 

([5Tf| . ([536]) and (f5TTj) imply that 

Wef- 1 . (5.18) 

Now we combine (j51?|) . ((STTUl) . ([5TT2]) and ([STT51) to see that a G conv(F U 
{aj?}). This is a contradiction to the assumption that a ^ conv(FU{aF})- 
This completes the proof. □ 

We have just done the first of the two main steps to prove our main result. 
The second step is the following lemma. 

Lemma 5.5: Every uniformly pyramidal polytope is a simplex. 

Proof. We prove this lemma by induction over the dimension of the polytope. 

• Base case: P is a uniformly pyramidal 1-polytope 

This case is trivial. Every 1-polytope is a 1-simplex. 

• Inductive step: P is a uniformly pyramidal <i-polytope 

Let d G N,d > 2 and assume that every uniformly pyramidal (d — 1)- 
polytope is a simplex (induction hypothesis). Let P be a uniformly pyra- 
midal d-polytope, let B C P be a facet of P. We show that B is a simplex. 
By the definition of a simplex (c.f. Example l2.33|) . this is sufficient to show 
that P is a simplex since the apex is as is affinely independent of B. 

The polytope P is uniformly pyramidal. Therefore, there is an apex a^ G 
P such that P = conv(_BU{as}). This implies that the number of vertices 
of P is n + 1, where n is the number of vertices of B. Let F C B be a 
facet of B. By Lemma I2~i2l G := conv(F U {a B }) is a facet of P. The 
polytope P is uniformly pyramidal, so P — conv(G U {ag}) for some 
«G G P. P has n+1 vertices, so G has n vertices. Thus, F has n — 1 
vertices, which means that B has only one more vertex than F. This 
implies that B = conv(F U {of}) for some of G -B. The set ,F is an 
arbitrary face of B, so we have just shown that B (which is a (d — 1)- 
polytope) is uniformly pyramidal. By the induction hypothesis, every 
uniformly pyramidal (d — l)-polytope (in particular B) is a simplex. □ 

Theorem 5.6 (Main Result): Let A be a polytopic theory that satisfies Pos- 
tulate [3J Then A is a classical theory. 



Proof. This theorem is a corollary of Lemma 15.41 and Lemma 15.51 □ 
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6 Conclusion and Outlook 

In the first part of this thesis, we introduced the mathematics of convex sets, 
polytopes and generalized probabilistic theories, and we provided a derivation 
of the framework of abstract state spaces. To our knowledge, such a derivation 
is new. Along this introduction and framework derivation, we developed most 
of the techniques that we used to infer the results of Part|TI]of this thesis. 

In Sectional we have seen that within poly topic theories, we can character- 
ize classical theory by Postulates Q] [2] and [3l i.e. by repeatability, a subspace 
principle and a state discrimination principle. To our knowledge, such an inter- 
pretation has not been made before. From a technical viewpoint, the inside into 
the mathematics of abstract state spaces gained by the proof in Section 14.41 is 
arguably small since, in particular, Postulate [5] is quite strong. From a physical 
viewpoint, however, the ideas presented in Section [4] could provide interesting 
ideas in future attempts to infer quantum theory from physical postulates (c.f. 
Section [Ol and the beginning of Part [TT|) . 

In Section [5j we have seen that the very reasonable and seemingly weak 
Postulate H] is in fact strong enough to rule out all polytopic theories except for 
classical theories. This might be an interesting starting point in future attempts 
to derive quantum theory from physical assumptions. In particular, it is a new 
approach insofar as to our knowledge, there has not been any consideration 
of post-measurement states in generalized probabilistic theories so far. Within 
polytopic theories, we have seen that it is sufficient to assume Postulate 2] alone 
to rule out "unreasonable theories" (assuming that the only reasonable theories 
are classical theory and quantum theory). In a more general class of theories, it is 
presumably necessary to assume a few more postulates to rule out "unreasonable 
theories" . This is where Postulates [TJ [2] and [3] might come into play. 

However, from the technical side, if one wants to generalize the results of this 
thesis to broader classes of abstract state spaces (i.e. to non-polytopic theories) , 
one needs to develop other techniques than those presented in this thesis. Our 
derivations rely quite strongly on the fact that we are dealing with polytopes. 
For example, the concept of a facet of a polytope is very central in our deriva- 
tions. In general, convex sets do not have something like a facet. The largest 
proper face of the Bloch sphere, for example, is a single point and therefore a 
zero-dimensional face of a two-dimensional convex set. All our techniques in- 
volving facets do not apply in this case. Nonetheless, it might be that the same 
Postulates, by the use of other techniques, have strong implications in broader 
classes of generalized probabilistic theories. 
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Appendix 

A Compact convex sets in infinite-dimensional 
topological vector spaces 

In Section l2~3l we have seen that by a theorem of Minkowski (Theorem l2.14[) . ev- 
ery compact convex subset of a finite-dimensional vector space is the convex hull 
of its extreme points. This theorem can be generalized to infinite-dimensional 
topological vector spaces, where it is known as the Krein-Milman Theorem. 
However, in that case, the statement is weaker: one needs to take the closure. 

Theorem A.l (Krein-Milman, see [Wer071 Theorem VIII.4.4]): Let V be a 
locally convex Hausdorff topological vector space, let C be a compact convex 
subset of V. Then C is the closure of the convex hull if its extreme points, 



C = conv(ext(C)) 



The Krein-Milman Theorem allows us to consider polytopes in a more gen- 
eral setting. In Section 12.41 we have defined polytopes as compact convex sub- 
sets of /mite-dimensional vector spaces with finitely many extreme points. It 
is very convenient to assume that the vector space is finite-dimensional since 
every finite-dimensional real vector space is essentially the same as M. n for some 
n. As we mentioned in Section 12.31 there is only one (Hausdorff) topology in 
finite dimensions. Now we want to turn to the question how restrictive it is to 
assume that the vector space is finite-dimensional. It turns out that as long as 
we stick to "well-behaved" topologies, this assumption does not cause a loss of 
generality. When we say "well-behaved" , we mean a locally convex Hausdorff 
topology. The following proposition states this formally. 

Proposition A. 2: Let P be a compact convex subset of a Hausdorff locally 
convex topological vector space V with finitely many extreme points. Then P 
is contained in a finite-dimensional subspace of V. 



Proof. By the Krein-Milman Theorem IA.1[ P = conv(ext(P)). Obviously, 
conv(ext(P)) C span(ext(P)). We have assumed that ext(P) is finite, so 
span(ext(P)) is a finite-dimensional subspace of V . Finite-dimensional sub- 
spaces of locally convex Hausdorff spaces are always closed [Wer07t Aufgabe 
VIII. 6. 5 (b)], so P C span(ext(P)) which is finite-dimensional. □ 

Thus, defining polytopes as subsets of finite-dimensional vector spaces causes 
no loss of generality (as long as we ignore topologies which are not Hausdorff 
and locally convex). This has an interesting consequence for our framework. In 
Section 13721 we have made Assumption [2] which states that the vector space con- 
taining the convex subset of normalized states is finite-dimensional. In Sections 
[Hand El we have considered polytopic theories, i.e. theories with finitely many 
pure states. Proposition I A . 2 1 allows us to weaken our assumptions when we deal 
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with theories with only finitely many pure states: If we make Assumptions Q] 
and[3l we have that the set of normalized states of a theory is a compact convex 
subset of a real vector space. When we restrict to theories with finitely many 
pure states, we do not have to make Assumption [2] since Proposition IA. 21 tells 
us that such a theory always takes place in a finite-dimensional subspace of a 
real vector space. This is interesting insofar as pure states have quite a direct 
physical interpretation (c.f. Section [3. 5p . whereas the physical interpretation of 
the dimension of the vector space is not so clear. 

B The equivalence of compact convex sets and 
abstract state spaces 



In Section r3.2[ we have listed three assumptions concerning the set of normalized 
states. These three assumptions state that the set of normalized states is a 
compact convex subset of a finite-dimensional vector space. Then we have 
sketched how such a compact convex set gives rise to an abstract state space. We 
mentioned that there is a one-to-one correspondence between compact convex 
subsets of finite-dimensional vector spaces and abstract state spaces. In this 
appendix, we formulate and prove this correspondence mathematically. This 
needs some preparation. 

At first we need to know when two cones are equivalent from the viewpoint 
of ordered vector spaces. Such an equivalence is established by an "isomorphism 
in the cone sense" . Such an isomorphism is called an order isomorphism and is 
defined as follows. 

Definition B.l: Let V and W be two ordered vector spaces. A linear operator 
4> : V — y W is called an order-isomorphism if 

(a) (j) is bijective and 

(b) v > in V if and only if (f>(v) > in W, i.e. v >v+ & 4>{v) >w+- 

In other words, a linear operator V is an order isomorphism if <f> maps the cone 
V + bijectively onto the cone W + . If there exists an order isomorphism between 
V and W, we say that V and W are order-isomorphic. 

In Section 13. 1[ we have introduced the notion of a base B of a cone V+ as a 
convex subset of V+ which fully characterizes the cone structure of V+. Thus, 
one would expect that in the case where two cones are equivalent in the cone 
sense (i.e. where the two cones are order-isomorphic), one can find bases of the 
cones which are equivalent in the convex set sense (i.e. two bases which are 
convex-isomorphic) and vice versa. This is indeed the case. 

Proposition B.2: Let (V, <y+)> (W, <w+) be finite-dimensional ordered vector 
spaces with positive cones V+, W+, respectively. Assume that V+ admits a base. 
Then the following are equivalent: 
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(a) V and W are order-isomorphic. 

(b) There is a base By of V+ and a base Bw of W+ such that By and Bjy are 
convex-isomorphic, and dim V — dim W. 

Proof. We prove the two implications (a) => (b) and (b) =>■ (a) separately. 

• (a) =>■ (b): Let V be an ordered vector space such that V+ has a base By. 
Let cj> : A — > B be an order isomorphism from V to some other ordered 
vector space W. V + is a base of V, so by Theorem l3.12[ there is a strictly 
positive linear functional / 6 V* and a a > such that 

By = {v G V+ | /(v) = a} . (B.l) 

Let g be the linear functional on W given by g(w) — (/ o _1 )(tt;) for 
all w E W. The functional g is obviously strictly positive on W, so 
Bw = {w E VF+ | g(iu) = a} is a base for W+ by Theorem 13.121 Now we 
show that By and Bw are convex-isomorphic. We have that 

4(B v )^{4(v)\veV + J(v)=a} 
= {weW+\3veV+:v = 4>-\w),f{v) = f{cj>- X {w)) = a} . 

V v ' 

W 

By the definition of an order-isomorphism, <f>\v + '■ V+ -^ W+ is bijective, 
so (•) is trivially satisfied for all w € W + . This means that 

4>{B V ) = {weW+\(fo (F 1 )^) = g(w) =a} = B w . 

The order-isomorphism <f> is linear and therefore convex-linear, so By and 
Biy are convex-isomorphic. The spaces V and W have the same dimension 
by the definition of an order-isomorphism. 

• (b) => (a): Let V+ be the positive cone of an ordered vector space V with 
base By- Let W+ be the positive cone of some other ordered vector space 
W with base Bw- Assume that there is a convex- linear map \ : V —> W 
such that x(&v) — Bw and Xspan(e v ) ^ s bijective (i.e. By and Bw are 
convex-isomorphic). We have assumed that dimV = dimW, so we can 
assume that x 1S bijective. According to Theorem 12. 19[ there is a linear 
map L : V ~^W and a vector c £ W such that x is given by xW) — L(v)+c 
for all v E V. The map x is bijective, so L must be invertible. In the 
following, we construct a map <p : V ^>- W and show that it is an order- 
isomorphism. 



By Theorem [3321 there is a linear functional / e V* such that 

By = {v e V+ | /(«) = 1} . (B.2) 

Let by £ By be any vector in the base of V+. We have that /(fry) = 1, 
so &y ^ ker(/). Thus, any vector v £ V can be expressed as a unique 
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linear combination v = «kcr + aby, where v kcr E ker(/). We define the 
map cf> : V — > W by 

(j>(v) = 0(^kor + aby) = L(ukcr) + a(£(6y) + c) . 
The map (/> is linear since is the linear extension of the map 




■"\;uksrj for all v kcl - E ker(/) 

L(by) + c for the particular vector 6y 



To show that <j> is an order isomorphism, we have to show that (i) <j) is 
bijective and that (ii) v >v + if and only if <f>(v) >w + 0. To show these 
two properties, it is useful to notice that the restriction <J>\b v °f 4> t° &v 
coincides with x\b v - To see this, let v E By. We said that any vector 
v E V can be decomposed as v = v kcl + abv, where v kc r S ker(/). We 
assume that v E By so by (|B.2|) , we have that 1 = f(v) — f(aby +Vker) = 
af(by) = a and thus u = by + fker- This implies 

tp(v) = <p(v B + Uker) = 0(«b) + <H«kcr) = L(v B ) + C + L(v kcr ) 

= £(w B + u kcr ) + c = £(«) + c 
= X(w) ■ 

Now we show that 

(i) <fi is bijective: Note that by Theorem 13. 121 there is a linear functional 
g E W* such that By/ = {w E W+ | #(w) = 1}. From 4>\b v = x\b v i* 
follows that the functional / satisfying (|B.2|) can be chosen to be / = 
g o(f)\^} For any u kcr E ker(/), we have that g(4>(v kcl )) = f{v kcl ) = 0, 
so <j) maps ker(/) bijectively to ker(g) (since L is bijective). It remains 
to be shown that 4>{by) £ kev(g). This follows from g(<j)(by)) = 
f(by) = 1, so 4» is bijective. 

(ii) v > v+ «• 4>(v) > w+ 0: Trivially, we have that v — ■&■ <f>(v) = 0. 
For v >y + =>• 4>(v) >w+ 0, note that we have shown above that 
4>\b v — x\b v - The set By is a base of V+, so if u >y + (i.e.v E 
V+\{0}), then v — a v b v for some a v > 0, 6„ £ By. Therefore, 
w >y + implies 

0(t>) = a v xQ>v) >w + by property (1X71) . 

>o es vr 

By the bijectivity of </> we can make the same argumentation in the 
reverse direction to get that <f>{v) >w + =>• v >y + 0. This completes 
the proof. D 



^ote that if span(S^) ^ V, the functional / is not unique. 
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The idea we want to develop in the following is that a set of normalized 
states, i.e. a compact convex subset of a finite-dimensional real vector space, 
gives rise to a "unique" abstract state space. In this context, "unique" means 
unique up to order-isomorphism. Vice versa, we will see that every abstract state 
space gives rise to a unique compact convex subset of a finite-dimensional vector 
space. Thus, the two structures are in a one-to-one-correspondence. Before we 
can prove this correspondence, we need to prove three lemmas. 

Lemma B.3: Let H be an afhne hyperplane in a vector space V . Then the 
following two statements are equivalent: 

(a) span(ff) = V. 

(b) The zero vector is not contained in H. 

Proof. We prove the two implications (a) =4> (b) and (6) => (a) separately. 

• (a) => (b): Suppose that the zero vector is contained in H. By the def- 
inition of an afhne hyperplane, there is a linear functional / G V* and a 
k G M such that H = {v G V \ f(v) = k}. The zero vector is in H, so 
k = since /(0) = by the linearity of /. Let x G span(iJ), i.e. 

x = > otiVi for some numbers a» G R and some vectors i>i G H. 

i 

Then 

so x G H and therefore span(i7) = H ^ V. 

• (b) =>- (a): Suppose that the zero vector is not in H, i.e. the zero vector 
is affincly independent of H. It is easy to verify that 

aff (if U {0}) = aff (aff (H) U {0}) = {av\a£R,v £ aff (H)} = span(#) . 

The zero vector is affincly independent of H, so aff (H U {0}) = V since 
H is an afhne hyperplane in V (and therefore has codimension one). This 
implies that span(_ff) — V. □ 

Lemma B.4: Let C C V be a convex subset of a finite-dimensional real vector 
space V. Then there is a convex subset C of some finite-dimensional real vector 
space W such that the following properties are satisfied: 

• C and C" are convex-isomorphic, 

• aff(C') is an afhne hyperplane of W, 



• 



span(C') = W. 



97 



Proof. We perform a proof by cases. 

1. aff(C) is an affine hyperplane ofV: 

(a) aff (C) does not contain the zero vector: In this case, Lemma IB. 31 
applies and we see that span(C) = V. This means we can simply 
choose C = C in W = V and all required properties are satisfied. 

(b) aff(C) contains the zero vector: In this case, we translate C by a 
nonzero vector normal to aff (C) to obtain a set C which does not 
contain the zero vector and for which aff (C) is an affine hyperplane. 
The set C is convex-isomorphic to C because a translation is convex- 
linear by Theorem 12.181 Lemma IB. 31 applies to C and we see that 
span(C') = W, so C satisfies all the desired properties. 

2. aff (C) = V: In this case, consider C as a subset of V = V®R, i.e. consider 
C = {{v, 0) e V ®R \ v e C} C V . The set C is convex-isomorphic to C 
and the affine hull aff (C) of C is an affine hyperplane of V. For C, either 
case 1. (a) or 1. (b) is on hand, for which we have proved the claim. 

3. dim(aff(C)) < dimF — 2. In this case, consider C as a subset of the vector 
space span(C). Then C applies to case 1. or 2., for which we have already 
proved the claim. □ 

Lemma B.5: Let B be a convex subset of a finite-dimensional real vector space 
V. Assume that there exists a linear functional f £ V* and a k > such that 
f(v) = k for all v G B. Then the following hold: 

(a) K := {av a > 0, v e £>} is a cone and B is a base for K. 

(b) If in addition span(£>) = V, then K is generating. 
Proof. We prove the two claims separately. 

(a) We verify properties (|3.1[) . (|3.2|) and (|3.3j) to show that K is a cone. 

• (|3.ip : Let v,w £ K. We have that v = aii, w = bw for some a, b > 
and some v, w 6 B. Then 

v + w — av + bw = (a + b) tv H rw } € K . 

v ' \ a + b a + b J 



£B since B is convex 



For any a > and v G K, we have that aK £ if by the 
definition of K, so Q3.2p is satisfied. 

p. 31) : Let we i^be nonzero and let a; € K such that a; = ex for some 
c > 0. If —x G if, then —a; G B. This would contradict the assumption 
that there is a linear functional / G V* and a k > such that f(v) = k 
for all u G 23. This implies K n (-if) = {0}. 



<Ks 



Therefore, K is a cone. To see that B is a base of K, note that for every 
y € K {0}, there is a f3 > and & y E B such that y = /3y. If moreover 
y = 7-S, we have 

o = f(y)-f(y) = Pf(y)-if(2) = (P-i)k, fc>o =* /? = 7 

=*> y = «■ 

Therefore, the representation y = f3y is unique and we have that B is a basis 
for the cone K. 

(b) Assume that span(£>) = V. It is easily verified that K — K = span(iif) = 
span(£>). Thus, K is generating. □ 

With these three lemmas, we are ready to prove one direction of the equiv- 
alence between compact convex subsets of finite-dimensional vector spaces and 
abstract state spaces. It reads as follows. 

Proposition B.6: Let Ha be a set of normalized states which satisfies As- 
sumptions [TJ [2] and Then there is an abstract state space (A, A + ,ua) such 
that 0^4 :— {uj G A + \ ua{u) ~ 1} is convex-isomorphic to O^. This abstract 
state space is unique up to order-isomorphism. 

Proof. Assumptions [1] [2] and [3] imply that Ha is a compact convex subset of 
a finite-dimensional vector space. By virtue of Lemma IB.41 there is a convex 
subset VLa of a finite-dimensional real vector space A which is convex-isomorphic 
to VLa such that span^^) = A and such that afi^O^) is an affine hyperplane of 
A. A convex-linear bijection is obviously a homeomorphism, so 51 a is compact. 
It holds that span(aff {Qa)) — span(51 J 4) = A, so by Lemma fB.31 the zero vector 
is not contained in afl^f^). This implies that there exists a linear functional 
/ G V* and a k > such that f(v) — k for all cj G Ha- This allows us to 
apply Lemma [B. 5 1 to get a generating cone A + with base ft a- By Theorcm l3.I4l 
and the compactness of flA, we have that A + is closed. Theorem 13.121 implies 
the existence of a strictly positive linear functional ua with Ha — {^ G A + \ 
ua(oj) = 1}. By Theorem I3.18[ ua is an order unit. This shows the existence of 
an abstract state space (^4, A + , ua) with the claimed properties. The uniqueness 
up to order isomorphism follows from Proposition IB. 21 and the fact that being 
convex-isomorphic is a transitive relation. □ 

This establishes that the structure of a set of normalized states that satisfies 
Assumptions [IJ [2] and [3] gives rise to the structure of an abstract state space 
(unique up to order isomorphism). To have an equivalence of the two structures, 
we also want the converse. This is provided by the following theorem. 

Theorem B.7 ( |AT07[ Chapter 3.1]): If V+ is a closed and generating cone 
of a finite-dimensional real vector space V and uy is a strictly positive linear 
functional on V, then the set {v G V \ iiy(w) = 1} is a compact base for V+. 

This shows that there is a one-to-one-correspondence between 
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(a) the structure of a compact convex subset of a finite-dimensional real vector 
space and 

(b) the structure of an abstract state space. 



In section 13.21 the physical and mathematical assumptions for the set of nor- 
malized states defined structure (a), but the one-to-one-correspondence allows 
us to treat them as structure (b). This is why we can deal with abstract state 
spaces when we talk about generalized probabilistic theories. 
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Symbols and abbreviations 



Quantum theory 



H The Hilbert space associated with a quantum system 

Hcrm('H) The vector space of Hcrmitian operators on Hilbert space T-L 

p A density operator on 7i 

S(H) Set of density operators on Hilbert space T-L 

fp The linear functional p n> tr (Pp) for some Hcrmitian operator 
P cm T-L 

supp p The support of a density operator 



Convex sets, polytopes and cones 



x ,y 
ext(C) 

aff(M) 

conv(M) 

span(M) 

cone(M) 

M 

(V,<) 

v+ 

<K 

VI 



The line segment connecting the points x and y 

The set of extreme points of a convex subset C of a real vector 
space 

The affine hull of a subset M of a real vector space 

The convex hull of a subset M of a real vector space 

The linear span of a subset M of a vector space 

The conical hull of a subset M of a vector space 

The closure of a subset of a topological space 

An ordered vector space 

The positive cone of an ordered vector space V 

The cone order over a vector space V induced by a cone K C V 

The dual cone of a cone V + C V 
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Abstract state spaces 



(A,A+,ua) An abstract state space 

A, B The vector space associated with an abstract state space (of- 

ten used as a symbol for the whole triple (A, A + ,ua) or 
(B, B + ,Ub), respectively) 

A + The positive cone in an abstract state space A 

ua The distinguished order unit on an abstract state space A 

Qa The set of normalized states in an abstract state space A 

0^ The set of subnormalizcd states in an abstract state space A 

Ea The set of effects on an abstract state space A 
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Index 



abstract state space, 2H 

induced by a subset, see induced 
abstract state space 
affine hull, Q3 
affine map, [H)] 
affinely independent, [TH1 l2"01 
apex 

of a pyramid, [30l 

base 

of a cone, [35] 
of a pyramid, [3D] 

Caratheodory, theorem by, [33] 

classical theory, [51] [5UJ 

complementary effect, 1341 

cone, [33] 

generating, see generating cone 
positive, see positive cone 

cone order, [35] 

conical hull, 1331 

convex hull, [T^l 

convex set, [9] 

convex space, [8] 

convex subset, see convex set 

convex-isomorphic, 1161 

convex- linear map, 1161 

dimension 

of a face, [M] 

of a polytope, [2D] 

of an affine hull, [2UJ 
dual cone, [36] 
dual norm, 1451 
dual order, [35] 

effect, HFJ 

complementary, see complementary 
effect 

induced by a transformation, [33J 
extreme point, [TUJ 



face, [TT] 
d-face, [ 



associated with an effect f . [331 
opposite, [33] 
proper, ITTI 
facet, [H 

generating cone, l40l 

Hilbert Projection Theorem, [15] 

induced abstract state space, [72] 

Krein Milman Theorem, [93] 

line segment, [3] [7] El 
linear functional 

positive, see positive linear func- 
tional 
strictly positive, see strictly posi- 
tive linear functional 

measurement, 1481 
Minkowski, theorem by, [T3J 

operation, [64] 

associated with a pure measure- 
ment, [84] 
order interval, [33J 
order unit, 03] 

in V,M 

on V,M 
order-isomorphic, [M] 
order-isomorphism, [M] 
ordered vector space, [3J] 

partial order, [33] 

partially ordered vector space, see or- 
dered vector space 
perfectly distinguishable sets of states, 

m 

perfectly distinguishable states, [5jJ 
physical subspace, [75] 
polygon, HD] 
polyhedral set, [21] 
polyhedron, [20] 
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polytope, Q1J] 

d-polytope, [20] 
positive cone, [55] 
positive linear functional, [55J 
positive map, [551 
pure state, 03] 
pyramid, [30] 

d- pyramid, [50] 

set of effects, HFJ 
set of normalized states, [41] [44] 
set of subnormalized states, [32] 
sets of states 

perfectly distinguishable, see per- 
fectly distinguishable sets of 
states 
simplex, [531 EH 

d-simplex, l24l 
state 

pure, see pure state 
states 

perfectly distinguishable, see per- 
fectly distinguishable states 
strictly positive linear functional, 1361 
subspace 

physical, see physical subspace 

topological vector space, [T5I 
transformation, [55J 

vertex, [T5J 
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